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CRYSTALLOGRAPHIC STRUCTURE OF THE ANDROGEN RECEPTOR 

LIGAND BINDING DOMAIN 

Field of Invention 

The present invention relates to compositions and crystals of 
5 androgen receptor ligand binding domain optionally in complex with its 
ligand. This invention also relates to methods of using the structure 
coordinates of the androgen receptor ligand binding domain /ligand 
complex to solve the structure of similar or homologous proteins or 
protein complexes. This invention also relates to methods for designing 

10 and selecting ligands that bind to the androgen receptor and methods of 
using such ligands. 
Background of the Invention 

The androgen receptor (AR) is a member of the steroid nuclear- 
receptor superfamily of ligand-dependent transcription factors. The 

1 5 binding of androgen to AR initiates the gene activation required for male 
sex development. 

AR is an important target primarily in two drug discovery areas. 
In oncology drug discovery, inhibitors (antagonists or partial antagonists) 
of androgen receptor function are useful for treatment of anti-androgen 

20 refractory prostate cancer. In metabolic diseases drug discovery, agonists 
or partial agonists to the androgen receptor in muscle are useful to treat 
age-related diseases. 

As with the other members of the steroid receptor family, AR has 
several functional domains including a DNA binding domain (DBD), and 

25 a 261 residue ligand-binding domain (LBD) (Mw = 30,245 Da) which 

contains the androgen binding site, and is responsible for switching on 
the androgen function. 

Development of synthetic ligands that specifically bind to 
androgen receptors has been largely guided by trial and error method of 

30 drug design despite the importance of the androgen receptor in 

physiological processes and medical conditions such as prostate cancer 
and modulation of reproductive organ modulation. Previously, new 
ligands specific for androgen receptors were discovered in the absence of 
information on the three dimensional structure of the androgen receptor 

35 with a bound ligand. Before the present invention, researchers were 
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essentially discovering androgen receptor ligands by probing in the dark 
and without the ability to visualize how the amino acids of the androgen 
receptor held a ligand in its grasp. 

Consequently, it would be advantageous to devise methods and 
5 compositions for reducing the time required to discover ligands to the 
androgen receptor, synthesize such compounds and administer such 
compounds to organisms to modulate physiological processes regulated 
by the androgen receptor. 

The cDNA and amino acid sequences of human and rat 
10 androgen receptors have been described (Proc. Natl. 

Acad. Sci. U.S.A. 1988 85: 7211-7215). However, there have been no 
crystals reported of any androgen receptor. Thus, x-ray crystallographic 
analysis of such proteins has not been possible. 

We have discovered the first crystal structure of the androgen 
15 receptor ligand binding domain (AR-LBD). Our understanding or the 
androgen receptor structure has allowed for the determination of the 
ligand binding site for selective androgen receptor modulators (SARMs). 
Summary of the Invention 

The present invention provides crystals of AR-LBD and crystals 
20 of an AR-LBD bound to a ligand, i.e. an AR-LBD/ AR-LBD ligand 

complex. Most preferably the AR-LBD ligand is dihydrotestosterone 
(DHT). Thus, the present invention is directed to a crystal of an AR-LBD 
comprising: 

1) an AR-LBD and an AR-LBD ligand or 
25 2) an AR-LBD without an AR-LBD ligand; 

wherein said crystal diffracts to at least 3 angstrom resolution and has a 
crystal stability within 5% of its unit cell dimensions. The crystal of AR 
or AR-LBD preferably has at least 200 amino acid and preferably 
comprises amino acid sequence 672 to 917 of rat AR or the AR amino 
30 acid sequence 672 to 917 of human AR. 

The present invention also provides the structure coordinates of 
the AR-LBD/ AR-LBD ligand complex. The complete coordinates are 
listed in Table A. 

The present invention also provides a method for determining at 
35 least a portion of the three-dimensional structure of molecules or 

molecular complexes which contain at least some structurally similar 
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features to the androgen receptor ligand binding domain. It is preferred 
that these molecules or molecular complexes comprise at least a part of 
the ligand binding site defined by structure coordinates of AR-LBD 
amino acids V685, L700, L701, S702, S703, L704, N705, E706, L707, 
5 G708, E709, Q711, A735, 1737, Q738, Y739, S740, W741, M742, G743, 
L744, M745, V746, F747, A748, M749, G750, R752, Y763, F764, A765, 
L768, F770, M780, M787, 1869, L873, H874, F876, T877 and F878 
according to Table A, or a mutant or homologue thereof. Since the 
protein sequences for rat and human AR LBD are identical, the human 
10 numbering system was used herein. 

The present invention also provides a machine-readable data 
storage medium which comprises a data storage material encoded with 
machine readable data defined by the structure coordinates of an AR- 
LBD/ AR-LBD ligand or ligand complex according to Table A or a 
1 5 homologue of the complex. 

The present invention further provides a binding site in AR-LBD 
for an AR-LBD ligand as well as methods for designing or selecting AR 
modulators including agonists, partial agonists, antagonists, partial 
antagonists and/ or selective androgen receptor modulators (SARMs) of 
20 AR using information about the crystal structures disclosed herein. 
Brief Description of the Drawing 

Figure 1 is a ribbon style drawing of the Androgen Receptor 
LBD. The substrate DHT is shown as a ball-and-stick figure. 

Figure 2 is a comparison of the androgen receptor ligand 
25 binding domain with progesterone receptor ligand binding domain. 

Figure 3 provides three views of the omit electron density map of 
dihydro testosterone (DHT) in the hormone-binding site of AR-LBD. There 
are hydrogen bonds between the steroid and the side chains of Arg 752 
and Asn 705. 

30 Figure 4 is a comparison of the binding of dihydrotestosterone to 

AR-LBD (top) and of progesterone to PR- LBD (bottom). Note that an 
additional hydrogen bond interaction would be possible if both the 
sidechains of both N719 and the progesterone were flipped. 
Detailed Description of the Invention 

35 The first crystal structure of the androgen receptor ligand 

binding domain (AR-LBD) has been determined to 2.0 A resolution. 
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Ciystals of rat AR-LBD were grown from precipitating solutions 
containing 0.9 M Sodium Tartrate, 0.1 M Na Hepes, pH 7.5. X-ray 
diffraction from the crystals have the symmetry and systematic absences 
of the orthorhombic space group P2l2l2l with unit cell dimensions a = 
5 56.03 A , b = 66.27 A, c= 70.38 A, and one molecule per asymmetric 

unit (Mathews Volume = 2. 16 A 3 Da- 1 ). The structure was determined by 
the method of molecular replacement using the structure of the 
Progesterone Receptor LBD (PR-LBD) as the search model. 

The complex of AR-LBD with dihydrotestosterone (DHT) shows 
10 the mode of binding of the steroid to the receptor in the agonist 
conformation. 

O Chiral 




Dihydrotestosterone 

15 

The following abbreviations are used throughout the application: 

A = Ala = Alanine 

V = Val = Valine 
20 L = Leu = Leucine 

I = He = Isoleucine 

P = Pro = proline 

F = Phe = phenylalanine 

W = Trp = Tryptophan 
25 M = Met = Methionine 

G = Gly = Glycine 

S = Ser = Serine 

T = Thr = Threonine 

C = Cys = Cysteine 
30 Y = Tyr = Tyrosine 

N =Asn = Asparagine 
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Q 


=Gln = 


= Glutamine 


D 


= Asp 


= Aspartic Acid 


E 


= GIu 


= Glutamic Acid 


K 


= Lys 


= Lysine 


R 


= Arg 


= Arginine 


H 


= His 


= Histidine 



"Atom type" refers to the element whose coordinates have been 
determined. Elements are defined by the first letter in the column. 

10 

"X, Y, Z" crystallographically define the atomic position determined for 
each atom. 

"B" is a thermal factor that measures movement of the atom around its 
15 atomic center. 

"Occ" is an occupancy factor that refers to the fraction of the molecules 
in which each atom occupies the position specified by the coordinates. A 
value of "1" indicates that each atom has the same conformation, i.e., the 
20 same position, in all molecules of the crystal. 

Additional definitions are set forth in the specification where necessary. 

The androgen receptor (AR) described herein is intended to 
include any polypeptide which has the activity of the naturally occurring 

25 androgen receptor . The AR and AR-LBD contemplated herein includes 
all vertebrate and mammalian forms such as rat, mouse, pig, goat, 
horse, guinea pig, rabbit, monkey, orangutan and human. Such terms 
also include polypeptides that differ from naturally occurring forms of 
AR and AR-LBD by having amino acid deletions, substitutions, and 

30 additions, but which retain the activity of AR and AR-LBD, respectively. 
The crystal structure of the invention preferably contains at least 25%, 
more preferably at least 50%, more preferably at least 75%, more 
preferably at least 90%, more preferably at least 95%, more preferably at 
least 99%, and most preferably all of the coordinates listed in Table A. 

35 The crystal of the AR-LBD/ AR-LBD ligand of the invention preferably 

has the following unit cell dimensions in angstroms: a = 56.03 ± 5% , b 
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= 66.27 ± 5%, c = 70.38 ± 5% and an orthorhombic space group 
P2l2l2l. 

The AR-LBD ligand of this invention is any peptide, peptide 
mimetic or nonpeptide, including small organic molecules, that is 
5 capable of acting as a ligand for AR-LBD. In a preferred embodiment, 
the AR-LBD ligand is an AR modulator. By "AR modulator" it is meant 
an agonist or activator, a partial agonist or partial activator, an 
antagonist or inhibitor, or a partial antagonist or partial inhibitor which 
demonstrates tissue specific activations of the AR. Such compounds are 

10 also referred to herein as SARMs (selective androgen receptor 

modulators) of the AR-LBD. Examples of preferred agonists include 
androgens such as dihydro testosterone. 

The peptides referred to herein (e.g., AR, AR-LBD, and the like) 
may be produced by any well-known method, including synthetic 

15 methods, such as solid phase, liquid phase and combination solid 

phase/ liquid phase syntheses; recombinant DNA methods, including 
cDNA cloning, optionally combined with site directed mutagenesis; 
and/or purification of the natural products, optionally combined with 
enzymatic cleavage methods to produce fragments of naturally occurring 

20 Advantageously, the crystallizable compositions provided by this 

invention are amenable to x-ray crystallography. Thus, this invention 
also provides the three-dimensional structure of the AR-LBD /AR-LBD 
ligand complex, particularly the complex of rat AR-LBD with 
dihydro testosterone. 

25 The three-dimensional structure of the AR-LBD / 

dihydrotestosterone complex of this invention is defined by a set of 
structure coordinates as set forth in Table A. The term "structure 
coordinates" refers to Cartesian coordinates derived from mathematical 
equations related to the patterns obtained on diffraction of a 

30 monochromatic beam of X-rays by the atoms (scattering centers) of an 
androgen receptor/ dihydrotestosterone complex in crystal form. The 
diffraction data are used to calculate an electron density map of the 
repeating unit of the crystal. The electron density maps are then used to 
establish the positions of the individual atoms of the complex. 

35 Those of skill in the art will understand that a set of structure 

coordinates for a receptor or receptor/ ligand complex or a portion 
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thereof , is a relative set of points that define a shape in three 
dimensions. Thus, it is possible that an entirely different set of 
coordinates could define a similar or identical shape. Moreover, slight 
variations in the individual coordinates will have little effect on overall 
5 shape. 

The variations in coordinates discussed above may be generated 
because of mathematical manipulations of the structure coordinates. For 
example, the structure coordinates set forth in Table A could be 
manipulated by ciystallographic permutations of the structure 
10 coordinates, fractionalization of the structure coordinates; integer 

additions or subtractions to sets of the structure coordinates, inversion 
. of the structure coordinates or any combination of the above. 

Alternatively, modifications in the crystal structure due to 
mutations, additions, substitutions, and/ or deletions of amino acids, or 
1 5 other changes in any of the components that make up the crystal could 
also account for variations in structure coordinates. If such variations 
are within an acceptable standard error as compared to the original 
coordinates, the resulting three-dimensional shape is considered to be 
the same. 

20 Various computational analyses are therefore necessary to 

determine whether a molecule or molecular complex or a portion thereof 
is sufficiently similar to all or parts of the androgen 

receptor/ dihydro testosterone described above as to be considered the 
same. Such analyses may be carried out in current software 

25 applications, such as the Molecular Similarity application of QUANTA 
(Molecular Simulations Inc., San Diego, CA) version 4.1, and as 
described in the accompanying User's Guide. 

The Molecular Similarity application permits comparisons 
between different structures, different conformations of the same 

30 structure, and different parts of the same structure. The procedure used 
in Molecular Similarity to compare structures is divided into four steps: 
1) load the structures to be compared; 2) define the atom equivalences in 
these structures; 3) perform a fitting operation; and 4) analyze the 
results. 

35 Each structure is identified by a name. One structure is 

identified as the target (i.e., the fixed structure); all remaining structures 
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are working structures (i.e., moving structures). Since atom equivalency 
within QUANTA is defined by user input, for the purpose of this 
invention we will define equivalent atoms as protein backbone atoms (N, 
Cs, C and O) for all conserved residues between the two structures being 
5 compared. We will also consider only rigid fitting operations. 

When a rigid fitting method is used, the working structure is 
translated and rotated to obtain an optimum fit with the target 
structure. The fitting operation uses an algorithm that computes the 
optimum translation and rotation to be applied to the moving structure, 
1 0 such that the root mean square difference of the fit over the specified 

pairs of equivalent atom is an absolute minimum. This number, given in 
angstroms, is reported by QUANTA. 

For the purpose of this invention, any molecule or molecular 
complex that has a root mean square deviation of conserved residue 
15 backbone atoms (N, Ca, C, O) of less than 1.5 A when superimposed on 
the relevant backbone atoms described by structure coordinates listed in 
Table A are considered identical. More preferably, the root mean square 
deviation is less than 1.0 A. In a preferred embodiment of the present 
invention, the molecule or molecular complex comprises at least a 
20 portion of the ligand binding site defined by structure coordinates of AR- 
LBD amino acids V685, L700, L701, S702, S703, L704, N705, E706, 
L707, G708, E709, Q711, A735, 1737, Q738, Y739, S740, W741, M742, 
G743, L744, M745, V746, F747, A748, M749, G750, R752, Y763, F764, 
A765, L768, F770, M780, M787, 1869, L873, H874, F876, T877 and 
25 F878 according to Table A, or a mutant or homologue of said molecule or 
molecular complex. More preferred are molecules or molecular 
complexes comprising all or any part of the ligand binding site defined by 
structure coordinates of AR-LBD amino acids N705, Q711, R752, F764 
and T877 according to Table A, or a mutant or homologue of said 
30 molecule or molecular complex. Since the protein sequences for rat and 
human AR LBD are identical, the human numbering system has been 
used herein. 

The term "complex" or "molecular complex" means AR-LBD or a 
mutant or homologue of AR-LBD in a covalent or non-covalent 
35 association with a chemical entity or compound. 
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For purposes of the present invention, by "at least a portion of 
it is meant all or any part of the ligand binding site defined by these 
structure coordinates. 

By "mutant or homologue" as used herein it is meant a molecule 
5 or molecular complex having a similar structure and/ or sequences to 
AR-LBD. By "similar structure" it is meant a mutant or homologue 
having a binding pocket that has a root mean square deviation from the 
backbone atoms of said AR-LBD amino acids of not more than 1.5 
Angstroms. By "similar sequence" it is meant a mutant or homologue 
10 having 30%, or more preferably 75%, identity with AR-LBD. 

The term "root mean square deviation" means the square root of 
the arithmetic mean of the squares of the deviations from the mean. It is 
a way to express the deviation or variation from a trend or object. For 
purposes of this invention, the "root mean square deviation" defines the 
1 5 variation in the backbone of a protein or protein complex from the 
relevant portion of the backbone of the AR portion of the complex as 
defined by the structure coordinates described herein. 

Once the structure coordinates of a protein crystal have been 
determined they are useful in solving the structures of other crystals. 
20 Thus, in accordance with the present invention, the structure 

coordinates of an androgen receptor/ dihydro testosterone complex, and 
in particular a complex, and portions thereof is stored in a machine- 
readable storage medium. Such data may be used for a variety of 
purposes, such as drug discovery and x-ray ciystallographic analysis or 
25 protein crystal. 

Accordingly, in one embodiment of this invention is provided a 
machine-readable data storage medium comprising a data storage 
material encoded with the structure coordinates set forth in Table A. 
One embodiment utilizes System 10 as disclosed in WO 
30 98/ 1 1 134, the disclosure of which is incorporated herein by reference in 
its entirety 

For the first time, the present invention permits the use of 
structure-based or rational drug design techniques to design, select, and 
synthesize chemical entities, including inhibitory and stimulatory 
35 compounds that are capable of binding to AR-LBD, or any portion 
thereof. 
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One particularly useful drug design technique enabled by this 
invention is iterative drug design. Iterative drug design is a method for 
optimizing associations between a protein and a compound by 
determining and evaluating the three-dimensional structures of 
5 successive sets of protein/ compound complexes. 

Those of skill in the art will realize that association of natural 
ligands or substrates with the binding pockets of their corresponding 
receptors or enzymes is the basis of many biological mechanisms of 
action. The term "binding pocket" as used herein, refers to a region of a 

10 molecule or molecular complex, that, as a result of its shape, favorably 
associates with another chemical entity or compound. Similarly, many 
drugs exert their biological effects through association with the binding 
pockets of receptors and enzymes. Such associations may occur with all 
or any parts of the binding pockets. An understanding of such 

1 5 associations will help lead to the design of drugs having more favorable 
associations with their target receptor or enzyme, and thus, improved 
biological effects. Therefore, this information is valuable in designing 
potential ligands or inhibitors of receptors or enzymes, such as inhibitors 
of AR. 

20 The term "associating with" refers to a condition of proximity 

between chemical entities or compounds, or portions thereof. The 
association may be non-covalent — wherein the juxtaposition is 
energetically favored by hydrogen bonding or van der Waals or 
electrostatic interactions — or it may be covalent. 

25 In iterative drug design, crystals of a series of 

protein/ compound complexes are obtained and then the three- 
dimensional structures of each complex is solved. Such an approach 
provides insight into the association between the proteins and 
compounds of each complex. This is accomplished by selecting 

30 compounds with inhibitory activity, obtaining crystals of this new 

protein/ compound complex, solving the three dimensional structure of 
the complex, and comparing the associations between the new 
protein/ compound complex and previously solved protein /compound 
complexes. By observing how changes in the compound affected the 

35 protein/ compound associations, these associations may be optimized. 
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In some cases, iterative drug design is carried out by forming 
successive protein-compound complexes and then crystallizing each new 
complex. Alternatively, a pre-formed protein crystal is soaked in the 
presence of an inhibitor, thereby forming a protein /compound complex 
5 and obviating the need to crystallize each individual protein/ compound 
complex. 

As used herein, the term "soaked" refers to a process in which 
the crystal is transferred to a solution containing the compound of 
interest. 

10 The structure coordinates set forth in Table A can also be used 

to aid in obtaining structural information about another crystallized 
molecule or molecular complex. This may be achieved by any of a 
number of well-known techniques, including molecular replacement. 

The structure coordinates set forth in Table A can also be used 

1 5 for determining at least a portion of the three-dimensional structure of 
molecules or molecular complexes which contain at least some 
structurally similar features to AR. In particular, structural information 
about another crystallized molecule or molecular complex may be 
obtained. This may be achieved by any of a number of well-known 

20 techniques, including molecular replacement. 

Therefore, in another embodiment this invention provides a 
method of utilizing molecular replacement to obtain structural 
information about a crystallized molecule or molecular complex whose 
structure is unknown comprising the steps of: 

25 a) generating an X-ray diffraction pattern from said crystallized molecule 
or molecular complex; 

b) applying at least a portion of the structure coordinates set forth in 
Table A to the X-ray diffraction pattern to generate a three-dimensional 
electron density map of the molecule or molecular complex whose 

30 structure is unknown; and 

c) using all or a portion of the structure coordinates set forth in Table A 
to generate homology models of AR-LBD or any other nuclear hormone 
receptor ligand binding domain. 

Preferably, the crystallized molecule or molecular complex is 
35 obtained by soaking a crystal of this invention in a solution. 



By using molecular replacement, all or part of the structure 
coordinates of the AR-LBD/ AR-LBD ligand complex provided by this 
invention or molecular complex whose structure is unknown more 
quickly and efficiently than attempting to determine such information ab 
5 initio. 

Molecular replacement provides an accurate estimation of the 
phases for an unknown structure. Phases are a factor in equations used 
to solve crystal structures that can not be determined directly. Obtaining 
accurate values for the phases, by methods other than molecular 

10 replacement, is a time-consuming process that involves iterative cycles of 
approximations and refinements and greatly hinders the solution of 
crystal structures. However, when the crystal structure of a protein 
containing at least a homologous portion has been solved, the phases 
from the known structure provide a satisfactory estimate of the phases 

15 for the unknown structure. 

Thus, this method involves generating a preliminary model of a 
molecule or molecular complex whose structure coordinates are 
unknown, by orienting and positioning the relevant portion of the AR- 
LBD /AR-LBD ligand complex according to Table A within the unit cell of 

20 the crystal of the unknown molecule or molecular complex so as best to 
account for the observed X-ray diffraction pattern of the crystal of the 
molecule or molecular complex whose structure is unknown. Phases can 
then be calculated from this model and combined with the observed X- 
ray diffraction pattern amplitudes to generate an electron density map of 

25 the structure whose coordinates are unknown. This, in turn, can be 
subjected to any well-known model building and structure refinement 
techniques to provide a final, accurate structure of the unknown 
crystallized molecule or molecular complex [E. Lattman, "Use of the 
Rotation and Translation Functions", in Meth. Enzymol., 115, pp. 55-77 

30 (1985); M. G. Rossmann, ed., "The Molecular Replacement Method", Int. 
Sci. Rev. Set., No. 13, Gordon & Breach, New York (1972)]. 

The structure of any portion of any crystallized molecule or 
molecular complex, or mutant, homologue or orphan receptor that is 
sufficiently homologous to any portion of the AR-LBD /AR-LBD ligand 

35 complex can be solved by this method. Along with the aforementioned 
AR, there also exist a number of AR for which the activating or 
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deactivating ligands may not be characterized. These proteins are 
classified as AR due to strong sequence homology to other AR, and are 
known as orphan receptors. 

The structure coordinates are also particularly useful to solve 
5 the structure of crystals of AR-LBD/ AR-LBD ligand co-complexed with a 
variety of chemical entities. This approach enables the determination of 
the optimal sites for interaction between chemical entities, including 
interaction of candidate AR inhibitors with the complex. For example, 
high resolution X-ray diffraction data collected from crystals exposed to 
10 different types of solvent allows the determination of where each type of 
solvent molecule resides. Small molecules that bind tightly to these sites 
can then be designed and synthesized and tested for their AR inhibition 
activity. 

All of the complexes referred to above may be studied using well- 

15 known X-ray diffraction techniques and may be refined versus 1.5-3 A 
resolution X-ray data to an R value of about 0.20 or less using computer 
software, such as X-PLOR [Yale University, 1992, distributed by 
Molecular Simulations, Inc.; see, e.g., Blundell & Johnson, supra; Meth. 
Enzymol., vol. 114 8s 115, H. W. Wyckoff et al., eds., Academic Press 

20 (1985)]. This information may thus be used to optimize known AR 

agonists, partial agonists, antagonists, partial antagonists and SARMS, 
and more importantly, to design new AR agonists /antagonists. 

Accordingly, the present invention is also directed to a binding 
site in AR-LBD for an AR-LBD ligand in which a portion of AR-LBD 

25 ligand is in van der Walls contact or hydrogen bonding contact with at 

least one of the following residues: V685, L700, L701, S702, S703, L704, 
N705, E706, L707, G708, E709, Q711, A735, 1737, Q738, Y739, S740, 
W741, M742, G743, L744, M745, V746, F747, A748, M749, G750, R752, 
Y763, F764, A765, L768, F770, M780, M787, 1869, L873, H874, F876, 

30 T877, F878, L880, L881, V889, F891, P892, E893, M894, M895, A896, 
E897, 1898, 1899, S900, V901, Q902, V903, P904 or 1906 of AR-LBD. 
For purposes of this invention, by AR-LBD binding site it is also meant to 
include mutants or homologues thereof. In a preferred embodiment, the 
mutants or homologues have at least 25% identity, more preferably 50% 

35 identity, more preferably 75% identity, and most preferably 95% identity 
to residues V685, L700, L701, S702, S703, L704, N705, E706, L707, 



G708, E709, Q711, A735, 1737, Q738, Y739, S740, W741, M742, G743, 
L744, M745, V746, F747, A748, M749, G750, R752, Y763, F764, A765, 
U76S, F770, M780, M787, 1869, L873, H874, F876, T877, F878, L880, 
L881, V889, F891, P892, E893, M894, M895, A896, E897, 1898, 1899, 
5 S900, V901, Q902, V903, P904 or 1906 of AR-LBD binding sites. 

The present invention is also directed to a machine-readable 
data storage medium, comprising a data storage material encoded with 
machine readable data, wherein the data is defined by the structure 
coordinates of an AR-LBD/ AR-LBD ligand according to Table A or a 

10 homologue of said complex, wherein said homologue comprises 

backbone atoms that have a root mean square deviation from the 
backbone atoms of the complex of not more than 3.0A. Preferably, the 
machine-readable data storage medium, according to the invention, is 
wherein said molecule or molecular complex is defined by the set of 

1 5 structure coordinates for AR-LBD/ AR-LBD ligand according to Table A, 
or a homologue of said molecule or molecular complex, said homologue 
having a root mean square deviation from the backbone atoms of said 
amino acids of not more than 2.0 A. In a preferred embodiment the 
machine-readable data storage medium comprises a data storage 

20 material encoded with a first set of machine readable data comprising a 
Fourier transform of at least a portion of the structural coordinates for 
an AR-LBD /AR-LBD ligand according to Table A; which, when combined 
with a second set of machine readable data comprising an X-ray 
diffraction pattern of a molecule or molecular complex of unknown 

25 structure, using a machine programmed with instructions for using said 
first set of data and said second set of data, can determine at least a 
portion of the structure coordinates corresponding to the second set of 
machine readable data, said first set of data and said second set of data. 
The present invention also provides for computational methods 

30 using three dimensional models of the androgen receptor that are based 
on crystals of AR-LBD/ AR-LBD ligand complex. Generally, the 
computational method of designing an androgen receptor ligand 
determines which amino acid or amino acids of the AR-LBD interact with 
a chemical moiety (at least one) of the ligand using a three dimensional 

35 model of a crystallized protein comprising the AR-LBD with a bound 
ligand, and selecting a chemical modification (at least one) of the 
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chemical moiety to produce a second chemical moiety with a structure 
that either decreases or increases an interaction between the interacting 
amino acid and the second chemical moiety compared to the interaction 
between the interacting amino acid and the corresponding chemical 
5 moiety on the natural hormone. 

The computational methods of the present invention are for 
designing androgen receptor synthetic ligands using such crystal and 
three dimensional structural information to generate synthetic ligands 
that modulate the conformational changes of the androgen receptor's 

10 LBD. These computational methods are particularly useful in designing 
an agonist, partial agonist, antagonist or partial antagonist or SARMs to 
the androgen receptor, wherein the agonist, partial agonist, antagonist or 
partial antagonist or SARMS has an extended moiety that prevents any 
one of a number of ligand-induced molecular events that alter the 

15 receptor's influence on the regulation of gene expression, such as 

preventing the normal coordination of the activation domain observed for 
a naturally occurring ligand or other ligands that mimic the naturally 
occurring ligand, such as an agonist. As described herein, synthetic 
ligands of the androgen receptor will be useful in modulating androgen 

20 receptor activity in a variety of medical conditions. 

AR is known to comprise various domains as follows: 

1) a variable amino- terminal domain; 

2) a highly conserved DNA-binding domain (DBD); and 

3) a less conserved carboxyl-terminal ligand-binding domain (LBD). 

25 This modularity permits different domains of each protein to separately 
accomplish different functions, although the domains can influence each 
other. The separate function of a domain is usually preserved when a 
particular domain is isolated from the remainder of the protein. Using 
conventional protein chemistry techniques a modular domain can 

30 sometimes be separated from the parent protein. Using conventional 
molecular biology techniques each domain can usually be separately 
expressed with its original function intact or chimerics of two different 
nuclear receptors can be constructed, wherein the chimetics retain the 
properties of the individual functional domains of the respective nuclear 

35 receptors from which the chimerica were generated. 
Amino Terminal Domain 
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The amino terminal domain is the least conserved of the three 
domains. This domain is involved in transcriptional activation and in 
some cases its uniqueness may dictate selective receptor-DNA binding 
and activation of target genes by specific receptor isoforms. This domain 
can display synergistic and antagonistic interactions with the domains of 
the LBD. For example, studies with mutated and/ or deleted receptors 
show positive cooperativity of the amino and carboxy terminal domains. 
In some cases, deletion of either of these domains will abolish the 
receptor's transcriptional activation functions. 
DNA-Binding Domain 

The DBD is the most conserved domain. The DBD contains two 
perpendicularly oriented a-helixes that extend from the base of the first 
and second zinc fingers. The two zinc fingers function in concert along 
with non-zinc finger residues to direct nuclear receptors to specific target 
sites on DNA and to align receptor homodimer or heterodimer interfaces. 
Various amino acids in DBD influence spacing between two half-sites for 
receptor dimer binding. 
Ligand or AR Binding Domain 

The LBD is the second most highly conserved domain. Whereas 
integrity of several different LBD sub-domains is important for ligand 
binding, truncated molecules containing only the LBD retain normal 
ligand-binding activity. This domain also participates in other functions, 
including dimerization, nuclear translocation and transcriptional 
activation. Importantly, this domain is the binding site for ligands, i.e. 
AR modulators, and undergoes ligand-induced conformational changes 
as detailed herein. 

As described herein, the LBD of AR can be expressed, 
crystallized, its three dimensional structure determined with a ligand 
bound (either using crystal data from the same receptor or a different 
receptor or a combination thereof), and computational methods used to 
design ligands to its LBD, particularly ligands that contain an extension 
moiety that coordinates the activation domain of AR. 

Once a computationally designed ligand (CDL) is synthesized, it 
can be tested using assays to establish its activity as an agonist, partial 
agonist, antagonist or partial antagonist or SARM, and affinity, as 
described herein. After such testing, the CDLs can be further refined by 
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generating LBD crystals with a CDL bound to the LBD. The structure of 
the CDL can then be further refined using the chemical modification 
methods described herein for three dimensional models to improve the 
activity or affinity of the CDL and make second generation CDLs with 
5 improved properties, such as that of a super agonist or antagonist. 

Typically AR-LBD is purified to homogeneity for crystallization. 
Purity of AR-LBD is measured with SDS-PAGE, mass spectrometry and 
hydrophobic HPLC. The purified AR for crystallization should be at least 
97.5 % pure or 97.5%, preferably at least 99.0% pure or 99.0% pure, 
10 more preferably at least 99.5% pure or 99.5% pure. 

Initially purification of the unliganded receptor can be obtained 
by conventional techniques, such as hydrophobic interaction 
chromatography (HPLC), ion exchange chromatography (HPLC), and 
heparin affinity chromatography. 
1 5 To achieve higher purification for improved crystals of AR, it will 

be desirable to ligand shift purify the nuclear receptor using a column 
that separates the receptor according to charge, such as an ion exchange 
or hydrophobic interaction column, and then bind the eluted receptor 
with a ligand, especially an agonist or partial agonist. The ligand induces 
20 a change in the receptor's surface charge such that when re- 

chromatographed on the same column, the receptor then elutes at the 
position of the liganded receptor are removed by the original column run 
with the unliganded receptor. Usually saturating concentrations of 
ligand are used in the column and the protein can be preincubated with 
25 the ligand prior to passing it over the column. 

More recently developed methods involve engineering a "tag" 
such as with histidine placed on the end of the protein, such as on the 
amino terminus, and then using a nickle chelation column for 
purification, Janknecht R., Proc. Natl. Acad.Sci. USA Vol 88:8972-8976 
30 (1991) incorporated by reference . 

To determine the three dimensional structure of a AR-LBD, it is 
desirable to co-crystalize the LBD with a corresponding LBD ligand. 

Typically purified AR-LBD is equilibrated at a saturating 
concentration of ligand at a temperature that preserves the integrity of 
35 the protein. Ligand equilibration can be established between 2 and 37° 
C, although the receptor tends to be more stable in the 2-20° C range. 
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Preferably crystals are made with the hanging drop methods. 
Regulated temperature control is desirable to improve crystal stability 
and quality. Temperatures between 4 and 25°C are generally used and it 
is often preferable to test crystallization over a range of temperatures. It 
5 is preferable to use crystallization temperatures from 18 to 25°C, more 
preferably 20 to 23°C, and most preferably 22°C. 

Ligands that interact with AR can act as an agonist, partial 
agonist, antagonist or partial antagonist or SARM based on what ligand- 
induced conformational changes take place. 
1 0 Agonists or partial agonists induce changes in receptors that 

place them in an active conformation that allows them to influence 
transcription, either positively or negatively. There may be several 
different ligand-induced changes in the receptor's conformation. 

Antagonists or partial antagonists bind to receptors, but fail to 
1 5 induce conformational changes that alter the receptor's transcriptional 
regulatory properties or physiologically telcram conformations. Binding 
of an antagonist or partial antagonist can also block the binding and 
therefore the actions of an agonist or partial agonist. 

Partial agonists, or partial antagonists, bind to receptors and 
20 induce only part of the changes in the receptors that are induced by 

agonists or antagonists, respectively. The differences can be qualitative 
or quantitative. Thus, a partial agonist or partial antagonist may induce 
some of the conformation changes induced by agonists or antagonists, 
respectively, but not others, or it may only induce certain changes to a 
25 limited extent. 

As described herein, the unliganded receptor is in a 
configuration that is either inactive, has some activity or has repressor 
activity. Binding of agonist ligands induces conformational changes in 
the receptor such that the receptor becomes more active, either to 
30 stimulate or repress the expression of genes. The receptors may also 

have non-genomic actions, some of the known types of changes and/ or 
the sequelae of these are listed herein. 

Heat shock protein binding domains present a region for binding 
to the LBD and can be modulated by the binding of a ligand to the LBD. 
35 Consequently, an extended chemical moiety (or more) from the ligand 
that stabilizes the binding or comact of the heat shock protein binding 
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domain with the LBD can be designed. Typically such chemical 
moieties will extend past and away from the molecular recognition 
domain on the ligand and usually past the buried binding cavity of the 
ligand. 

Ligand binding by the receptor is a dynamic process, which 
regulates receptor function by inducing an altered conformation. 

The three-dimensional structure of the liganded AR receptor will 
greatly aid in the development of new AR synthetic ligands. In addition, 
AR is overall well suited to modern methods including three-dimensional 
structure elucidation and combinatorial chemistry such as those 
disclosed in EP 335 628, U.S. patent 5,463,564, which are incorporated 
herein by reference. Computer programs that use crystallography data 
when practicing the present invention will enable the rational design of 
ligand to AR. Programs such as RASMOL can be used with the atomic 
coordinates from crystals generated by practicing the invention or used 
to practice the invention by generating three dimensional models and/ or 
determining the structures involved in ligand binding. Computer 
programs such as INSIGHT and GRASP allow for further manipulation 
and the ability to introduce new structures. In addition, high throughput 
binding and bioactivity assays can be devised using purified recombinant 
protein and modern reporter gene transcription assays described herein 
and known in the art in order to refine the activity of a CDL. 

Generally the computational method of designing an AR 
synthetic ligand comprises two steps: 

1) determining which amino acid or amino acids of AR- LBD interacts 
with a first chemical moiety (at least one) of the ligand using a three 
dimensional model of a crystallized protein comprising an AR-LBD with a 
bound ligand; and 

2) selecting a chemical modifications (at least one) of the first chemical 
moiety to produce a second chemical moiety with a structure to either 
decrease or increase an interaction between the interacting amino acid 
and the second chemical moiety compared to the interaction between the 
interacting amino acid and the first chemical moiety. 

Preferably the method is carried out wherein said three dimensional 
model is generated by comparing isomorphous ligand derivatives to 
produce improved phasing. Further preferred is wherein said method 
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comprises determining a change in interaction between said interacting 
amino acid and said ligand after chemical modification of said first 
chemical moiety, especially wherein said three dimensional model is 
generated by comparing isomorphous ligand derivatives to produce 
5 improved phasing. Also preferred is wherein said selecting uses said 
first chemical moiety that interacts with at least one of the interacting 
amino acids V685, L700, L701, S702, S703, L704, N705, E706, L707, 
G708, E709, Q711, A735, 1737, Q738, Y739, S740, W741, M742, G743, 
L744, M745, V746, F747, A748, M749, G750, R752, Y763, F764, A765, 

10 L768', F770, M780, M787, 1869, L873, H874, F876, T877, F878, L880, 
L88l! V889, F891, P892, E893, M894, M895, A896, E897, 1898, 1899, 
S900, V901, Q902, V903, P904 or 1906. 

As shown herein, interacting amino acids form contacts with the 
ligand and the center of the atoms of the interacting amino acids are 

1 5 usually 2 to 4 angstroms away from the center of the atoms of the 
ligand. Generally these distances are determined by computer as 
discussed herein and in McRee 1993, however distances can be 
determined manually once the three dimensional model is made. See 
also Wagner et al., Nature 378(6558):670-697 (1995) for stereochemical 

20 figures of -three dimensional models. More commonly, the atoms of the 
ligand and the atoms of interacting amino acids are 3 to 4 angstroms 
apart. The invention can be practiced by repeating steps I and 2 to refine 
the fit of the ligand to the LBD and to determine a better ligand, such as 
an agonist, partial agonist, antagonist or partial antagonist or SARM. 

25 The three dimensional model of AR can be represented in two 

dimensions to determine which amino acids contact the ligand and to 
select a position on the ligand for chemical modification and changing 
the interaction with a particular amino acid compared to that before 
chemical modification. The chemical modification may be made using a 

30 computer, manually using a two dimensional representation of the three 
dimensional model or by chemically synthesizing the ligand. The ligand 
can also interact with distant amino acids after chemical modification of 
the ligand to create a new ligand. Distant amino acids are generally not 
in contact with the ligand before chemical modification. A chemical 

35 modification can change the structure of the ligand to make as new 
ligand that interacts with a distant amino acid usually at least 4.5 
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angstroms away from the ligand, preferably wherein said first chemical 
moiety is 6 to 12 angstroms away from a distant amino acid. Often 
distant amino acids will not line the surface of the binding cavity for the 
ligand, they are too far away from the ligand to be part of a pocket or 
5 binding cavity. The interaction between a LBD amino acid and an atom 
of an LBD ligand can be made by any force or attraction described in 
nature. Usually the interaction between the atom of the amino acid and 
the ligand will be the result of a hydrogen bonding interaction, charge 
interaction, hydrophobic interaction, van der Waals interaction or dipole 

10 interaction. In the case of the hydrophobic interaction it is recognized 
that this is not a per se interaction between the amino acid and ligand, 
but rather the usual result, in part, of the repulsion of water or other 
hydrophilic group from a hydrophobic surface. Reducing or enhancing 
the interaction of the LBD and a ligand can be measured by calculating 

1 5 or testing binding energies, computationally or using thermodynamic or 
kinetic methods as known in the art. 

Chemical modifications will often enhance or reduce interactions 
of an atom of a LBD amino acid and an atom of an LBD ligand. Steric 
hindrance will be a common means of changing the interaction of the 

20 LBD binding cavity with the activation domain. 

The present invention also provides methods for identifying 
compounds that modulate androgen receptor activity. Various methods 
or combinations thereof can be used to identify these compounds. For 
example, test compounds can be modeled that fit spatially into the AR- 

25 LBD as defined by structure coordinates according to Table A, or using a 
three-dimensional structural model of AR-LBD, mutant AR-LBD or AR- 
LBD homolog or portion thereof. Structure coordinates of the ligand 
binding site, in particular amino acids V685, L700, L701, S702, S703, 
L704, N705, E706, L707, G708, E709, Q711, A735, 1737, Q738, Y739, 

30 S740, W741, M742, G743, L744, M745, V746, F747, A748, M749, G750, 
R752, Y763, F764, A765, L768, F770, M780, M787, 1869, L873, H874, 
F876, T877, F878, L880, L881, V889, F891, P892, E893, M894, M895, 
A896, E897, 1898, 1899, S900, V901, Q902, V903, P904 or 1906 can also 
be used to identify structural and chemical features. Identified 

35 structural or chemical features can then be employed to design or select 
compounds as potential AR modulators. By structural and chemical 
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features it is meant to include, but is not limited to, van der Waals 
interactions, hydrogen bonding interactions, charge interaction, 
hydrophobic bonding interaction, hydrophobic interaction and dipole 
interaction. Alternatively, or in conjunction, the three-dimensional 
5 structural model or the ligand binding site can be employed to design or 
select compounds as potential AR modulators. Compounds identified as 
potential AR modulators can then be synthesized and screened in an 
assay characterized by binding of a test compound to the AR-LBD. 
Examples of assays useful in screening of potential AR modulators 
10 include, but are not limited to, screening in silico, in vitro assays and 
high throughput assays. Finally, these methods may also involve 
modifying or replacing one or more amino acids from AR-LBD such as 
V685, L700, L701, S702, S703, L704, N705, E706, L707, G708, E709, 
Q711, A735, 1737, Q738, Y739, S740, W741, M742, G743, L744, M745, 
15 V746, F747, A748, M749, G750, R752, Y763, F764, A765, L768, F770, 
M780, M787, 1869, L873, H874, F876, T877, F878, L880, L881, V889, 
F891, P892, E893, M894, M895, A896, E897, 1898, 1899, S900, V901, 
Q902, V903, P904 or 1906 of AR-LBD according to Table A. 

A preferred method of the invention can be described as a 
20 computational method of designing an androgen receptor antagonist 
from an androgen receptor agonist comprising: 

1) determining a structure of a molecular recognition domain of 
said agonist using a three dimensional model of a crystallized 
protein comprising an AR-LBD, and 
25 2) selecting at least one chemical modification of said agonist 

that provides a ligand structure that extends beyond a 
binding site for said agonist and in the direction of at least 
one protein domain important in AR biological function. 
Another preferred method of the invention can be described as a 
30 computational method of designing a selective androgen receptor 

modulator such as an androgen receptor super agonist or antagonist 
comprising: 

1) determining at least one interacting amino acid of an AR-LBD 
that interacts with at least one first chemical moiety of said 
35 ligand using a three dimensional model of a crystallized 

protein comprising AR-LBD with a bound ligand, and 
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2) selecting at least one chemical modification of said first 

chemical moiety to produce a second chemical moiety with a 
structure to reduce or enhance an interaction between said 
interacting amino acid and said second chemical moiety 
5 compared to said interaction between said interacting amino 

acid and said first chemical moiety. 

However, as will be understood by those of skill in the art upon this 
disclosure, other structure based design methods can be used. Various 
10 computational structure based design methods have been disclosed in 
the art. 

For example, a number computer modeling systems are 
available in which the sequence of the AR-LBD and the AR-LBD 
structure (i.e., atomic coordinates of AR-LBD and/or the atomic 

15 coordinates of the active site, the bond and dihedral angles, and 

distances between atoms in the active site such as provided in Table A) 
can be input. This computer system then generates the structural 
details of the site in which a potential AR modulator binds so that 
complementary structural details of the potential modulators can be 

20 determined. Design in these modeling systems is generally based upon 
the compound being capable of physically and structurally associating 
with AR-LBD. In addition, the compound must be able to assume a 
conformation that allows it to associate with AR-LBD. Some modeling 
systems estimate the potential inhibitory or binding effect of a potential 

25 AR modulator prior to actual synthesis and testing. 

Methods for screening chemical entities or fragments for their 
ability to associate with AR-LBD are also well known. Often these 
methods begin by visual inspection of the active site on the computer 
screen. Selected fragments or chemical entities are then positioned with 

30 the AR-LBD. Docking is accomplished using software such as QUANTA 
and SYBYL, following by energy minimization and molecular dynamics 
with standard molecular mechanic forcefields such as CHARMM and 
AMBER. Examples of computer programs which assist in the selection 
of chemical fragment or chemical entities useful in the present invention 

35 include, but are not limited to, GRID (Goodford , P.J. J. Med. Chem. 

1985 28:849-857), AUTODOCK (Goodsell, D.S. and Olsen, A.J. Proteins, 
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Structure, Functions, and Genetics 1990 8:195-202), and DOCK (Kunts 
et al. J. Mol. Biol. 1982 161:269-288). 

Upon selection of preferred chemical entities or fragments, their 
relationship to each other and AR-ABD can be visualized and the entities 
5 or fragments can be assembled into a single potential modulator. 

Programs useful in assembling the individual chemical entities include, 
but are not limited to CAVEAT (Bartlett et al. Molecular Recognition in 
Chemical and Biological Problems Special Publication, Royal Chem. Soc. 
78, 182-196 (1989) ) and 3D Database systems (Martin, Y.C. J. Med. 
10 Chem. 1992 35:2145-2154). 

Alternatively, compounds may be designed de novo using either 
an empty active site or optionally including some portion of a known 
inhibitor. Methods of this type of design include, but are not limited to 
LUDI (Bohm H-J, J. Comp. Aid. Molec. Design 1992 6:61-78) and 
15 LeapFrog (Tripos Associates, St. Louis. MO). 

The present invention is also directed to an AR-LBD selective 
androgen receptor modulator (SARM), in particular an agonist or 
antagonist or partial agonist or partial antagonist, identified by a 
computational process of the invention. 
20 The present invention is further directed to a method for treating 

prostate cancer comprising administering an effective amount of an AR 
modulator, preferably an antagonist or partial antagonist, identified by a 
computational process of the invention. 

The present invention is also direct to a method for treating an 
25 age related disease comprising administering an effective amount of an 
AR modulator, preferably an agonist or partial agonist, identified by a 
computational process of the invention, preferably wherein said age 
related disease is osteoporosis, muscle wasting or loss of libido. 

Compounds identified as agonists, partial agonists, antagonists, 
30 partial antagonists or SARMs by the methods disclosed herein which are 
active when given orally can be formulated as liquids for example syrups, 
suspensions or emulsions, tablets, capsules and lozenges. A liquid 
composition will generally consist of a suspension or solution of the 
compound in a suitable liquid carrier(s), for example ethanol, glycerin, 
35 sorbitol, non-aqueous solvent such as polyethylene glycol, oils or water, 
with a suspending agent, preservative, surfactant, wetting agent, 
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flavoring or coloring agent. Alternatively, a liquid formulation can be 
prepared from a reconstitutable powder. For example a powder 
containing active compound, suspending agent, sucrose and a sweetener 
can be reconstituted with water to form a suspension; and a syrup can 
5 be prepared from a powder containing active ingredient, sucrose and a 
sweetener. A composition in the form of a tablet can be prepared using 
any suitable pharmaceutical carrier(s) routinely used for preparing solid 
compositions. Examples of such carriers include magnesium stearate, 
starch, lactose, sucrose, microcrystalline cellulose, binders, for example 

10 polyvinylpyrrolidone. The tablet can also be provided with a color film 
coating, or color included as part of the carrier (s). In addition, active 
compound can be formulated in a controlled release dosage form as a 
tablet comprising a hydrophilic or hydrophobic matrix. A composition in 
the form of a capsule can be prepared using routine encapsulation 

1 5 procedures, for example by incorporation of active compound and 

excipients into a hard gelatin capsule. Alternatively, a semi-solid matrix 
of active compound and high molecular weight polyethylene glycol can be 
prepared and filled into a hard gelatin capsule; or a solution of active 
compound in polyethylene glycol or a suspension in edible oil, for 

20 example liquid paraffin or fractionated coconut oil can be prepared and 
filled into a soft gelatin capsule. Compounds identified by the processes 
described herein which are active when given parenterally can be 
formulated for intramuscular or intravenous administration. A typical 
composition for intra-muscular administration will consist of a 

25 suspension or solution of active ingredient in an oil, for example arachis 
oil or sesame oil. A typical composition for intravenous administration 
will consist of a sterile isotonic aqueous solution containing, for example 
active ingredient, dextrose, sodium chloride, a co-solvent, for example 
polyethylene glycol and, optionally, a chelating agent, for example 

30 ethylenediaminetetracetic acid and an anti-oxidant, for example, sodium 
metabisulphite. Alternatively, the solution can be freeze dried and then 
reconstituted with a suitable solvent just prior to 
administration. Identified compounds which are active on rectal 
administration can be formulated as suppositories. A typical suppository 

35 formulation will generally consist of active ingredient with a binding 

and/ or lubricating agent such as a gelatin or cocoa butter or other low 
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melting vegetable or synthetic wax or fat. Identified compounds which 
are active on topical administration can be formulated as transdermal 
compositions. Such compositions include, for example, a backing, active 
compound reservoir, a control membrane, liner and contact adhesive. 
5 The typical daily dose of a varies according to individual needs, the 
condition to be treated and with the route of administration. Suitable 
doses are in the general range of from 0.001 to 10 mg/kg bodyweight of 
the recipient per day. 

The following examples are to illustrate the invention, but 
10 should not be interpreted as a limitation thereon. 
Examples 

Cloning, Expression and Purification of the Androgen Receptor 
Ligand-Binding Domain 

The rat androgen receptor (rAR) ligand-binding domain (LBD) 
15 cDNA, from amino Wid 646 to 901, was cloned from a rat prostate cDNA 
library (Clontech) byvPCR. The primers used were 

CATATGATTGAAGGOTATGAATGTCAACCTATCTTT (SEQ ID NO: 3) and 
TCACTGTGTGTGGAAATAGATGGG (SEQ ID NO:4). The rat AR LBD was 
expressed as a fusion protein driven by the T7 promoter of pET28b 
20 vector (Novagen) to include an N-terminal polyhistidine tag and a 
thrombin cleavage site. The replacement of T877 for A (the LNCaP 
mutation) in this rAR LBD\ expression construct was performed with the 
QuickChange Site-Directed\ Mutagenesis kit (STRATAGENE). 
Dihydrotestosterone (DHT) Vas included in the E. coli (BL21-DE3) 
25 fermentation medium at a concentration of 0.05 mM. Induction with 

0.4 mM isopropyl-p-D-thiogalkctopyranoside was allowed to proceed for 
16 hours at 20°C in M9 minirrkl media supplemented with casamino 
acids (Difco) and trace mineralsV and pellets were stored at -70 °C. A 
total of 6-9 mg of recombinant AR LBD was isolated from a 15 gram cell 
30 pellet following sonication and chromatography on a nickel-chelate resin. 
Polyhistidine-tagged AR LBD of approximately 90% purity eluted at 0.45 
M imidazole in a gradient of 0.05- IX) imidazole. This material was 
quantitatively cleaved at an engineered site for thrombin recognition, 
followed by chromatography on benzaWdine sepharose (Pharmacia) to 
35 remove the serine protease, with a 70%\recovery. The final sample 

containing the sequence Gly-Ser-His-Me\ at the N-terminus followed by 
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residues 646-901 of t&; rat (664 - 919 in the human) AR LBD protein, 
was concentrated for cLstallography to 2 mg/ml in 20 mM Tris (pH 7.5), 
0.5 M NaCl, 10% glycerol, 1 mM EDTA and 1 mM DTT. 

The sequence of the rat Androgen Receptor LBD (AR), as cloned, 
with the secondary structural features marked. For comparison, the 
aligned sequence of the Progesterone Receptor LBD (PR) is given. 
Residues involved in androgen binding are marked (*). Residues which 
are disordered in the crystal structure are underlined. The AR sequence 
is SEQ ID NO:l. The PR sequence is SEQ ID NO: 2. 

66 O^GSHMIEGYECQPIFLNVLEAIEPGVVCAGHDNNQPDSFARLLSSIjNELGE AR 
678 gqdIQLIPPLINLLMSIEPDVIYAGHDNTKPDTSSSLLTSLNQLGE PR 

I | H4/5 | 

710 RQLVHWKWAKALPGFRNLHVDDQMAVIQYSWMGLMVFAMGWRSFTNVNS AR 
724 RQLLSWKWSKSLPGFRNLHIDDQITLIQYSWMSLMVFGLGWRSYKHVSG PR 



SSSS SSS | -H6 | | H7 | |---H8-- 

760 RMLYFAPDLVFNEYRMHKSRMYSQCVRMRHLSQEFGWLQITPQEFLCMPCA AR 
774 QMLYFAPDLILNEQRMKESSFYSLCLTMWQIPQEFVKLQVSQEEFLCMKV PR 

25 -| SSS | H9 | I 

810 LLLFSIIPVDGLKNQKFFDELRMNYIKELDRIIACKRKNPTSCSRRFYQL AR 
824 LLLLNT I PLEGLRSQTQFEEMRS S YIREL I KAIGLRQKG WS S SQRFYQL PR 

— -H10/11- "I I--I I UX2 f 

30 860 TKLLDSVQPIARELHQFTFDLLIKSHMVSVDFPEMMAEIISVQVPKILSG AR 
874 TKLLDNLHDLVKQLHLYCLNTFIQSRALSVEFPEMMSEVIAAQIjPKILAG PR 

+ 

SSS 

910 KVKP I YFHTQ AR 
35 924 MVKPLLFHK PR 

Crystallization 

The AR-LBD - Dihydrotestosterone (DHT) complex was 
crystallized at 20° C by vapor diffusion in the hanging-drop mode. In the 

40 crystallization trials, the protein complex as obtained from MMB&B was 
used without any further purification. In the initial trial to obtain 
crystallization conditions, a sparse matrix crystallization screen was 
done with the Crystal Screens 1 and 2 (Hampton Research) . For each 
crystallization trial, a 2 \x\ drop was prepared by mixing 1 \x\ of purified 

45 protein (1.9 mg ml" 1 ) with an equal volume of reservoir solution. The 
reservoir contained 1.0 ml of the precipitating solution. Small crystals 
were obtained in two days from six of the drops (table 1). 
Table 1: Crystallization Conditions 
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Screen/# Precipitating Solution Result 
1/16 1.5 M Li Sulfate, 0. 1M Na Hepes, pH 7.5 Small rods 

1/29 0.8 M Na/K Tartrate, 0. 1M Na Hepes, pH 7.5 Larger rods 
1/30 2% v/v PEG 400, 2.0 M Am Sulfate, 
5 0. IM Na Hepes, pH 7.5 Small cubes 

2/20 1.6 M Mg Sulfate, 0. IM MES, pH 6.5 Small crystallites 

2/32 1.6 M Am Sulfate, 0.1 M Na CI, 

0. 1 M Hepes, pH 7.5 Small rods 

2/42 12% v/v Glycerol, 1.5 M Am Sulfate, 
10 0.1M Tris, pH 8. 5 Small rods 

The largest single crystal, measuring 0.05 mm x 0.04 mm x 0.26mm, 
was obtained from Crystal Screen 1, solution # 29 (0.8 M Na/K Tartrate, 
0.1M Na Hepes, pH 7.5). This crystal was subsequently used in the 

1 5 initial data collection run (as described below) . 

Optimization of the crystallization condition was done using a 
Cyperlab C-200 automated crystallization robotic workstation. A 
crystallization trial was performed using a 24-step linear gradient from 
0.6 M to 1.26 M Na tartrate, 100 Mm Hepes, pH 7.5 (Note: The 

20 optimization screen used sodium rather than sodium/ potassium 

tartrate). The largest, rod shaped crystal, with dimensions 0.09 mm x 
0.09 mm x 0.20mm, was obtained at 0.887 M Na Tartrate. This crystal 
was used in the second data collection run (as described below). 
Data Collection and Reduction 

25 For the initial X-ray experiment, the crystal from the initial 

crystallization screen was flash cooled by dipping it in a cryoprotectant 
solution containing the precipitating solution (0.8 M Na/K Tartrate, 0.1M 
Na Hepes, pH 7.5) with 250mm NaCl and 20% Glycerol added and then 
placed it in a cold stream at 100° K. 

30 For data set 1 , X-ray diffraction data were collected with an R- 

Axis II imaging plate detector. The radiation was generated from a 
Rigaku RU-200 rotating at 5 kw power with a fine focus filament (0.3 x 
3.0mm) was monchromated (Cu Kcc) and intensified by focusing with 
Yale mirrors (Molecular Structure Corporation). The crystal diffracted to 

35 better than 2.4 A resolution. Autoindexing and processing of the 

measured intensity data was carried out with the HKL software package 
(Otwinoski, L. (1993) in CCP4 Study Weekend, Data Collection and 
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Processing (Sawyer,L., Issacs, N., and Bailey, S., Eds.) pp 56-62, SERC 
Daresbury Laboratory, Warrington, U.K). X-ray diffraction from the 
crystals have the symmetry and systematic absences of the 
orthorhombic space group P212121 with unit cell dimensions a = 56.03 
5 a , b = 66.27 A, c= 70.38 A, and one molecule per asymmetric unit 
(Mathews Volume = 2.16 A 3 Da- 1 ). 

A second X-ray diffraction data set (data set 2) was collected at 
the IMCA-CAT beamline (sector 17ID) at the Advanced Photon Source 
synchrotron at Argonne, II. The crystal from the optimization screen 
10 described above, was flash-cooled by placing it in the reservoir solution 
(0.877 M Na Tartrate, 0.1M Na Hepes, pH 7.5) with 250mm NaCl and 
20% Glycerol added, and then placing it in a cold stream at 100° K. The 
data were collected with a Bruker 2x2 mosaic CCD detector. The crystal 
diffracted to better than 2.0 A. Autoindexing and processing of the 
15 measured intensity data was carried out with the HKL2000 software 

package (Otwinoski, L. (1993) in CCP4 Study Weekend, Data Collection 
and Processing (Sawyer,L., Issacs, N., and Bailey, S., Eds.) pp 56-62, 
SERC Daresbury Laboratory, Warrington, U.K.). The data collection and 
processing statistics for both data sets are summarized in table 2. 
20 Structure Determination (Molecular Replacement) 

The structure was determined by the method of molecular 
replacement with the program AmoRe (Navaza, J. (1994) AmoRe: an 
automated package for molecular replacement. Acta Cryst. D50, 157- 
163). The Progesterone Receptor ligand binding domain (PR-LBD), which 
25 has 54% sequence identity and 76% sequence homology to AR-LBD, was 
used as the search model. The atomic coordinates of PR-LBD (Protein 
Data Bank reference code 1A28) by Williams & Sigler (Nature 1998 393, 
39 1) were unmodified except for the removal of the ligand and solvent 
molecules. A second molecular replacement search was performed with 
30 a theoretical model for the AR-LBD provided by the MMS/CADD group 
(table 3) . The PR-LBD structure gave a slightly better solution than the 
AR-model (1.7cr vs.1.3ct above background) and was used in the 
subsequent refinement, although both structures gave equivalent results 
with no molecular interpenetration. 
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Tabl 2: Data Coll ction and Processing 


Data Set II 




Data Set 1 


Date 


5/19/99 


6/17/99 


Source/Detector 


Rigaku RU-200 


I MCA/APS 17ID 


Detector 


R-axis II 


Bruker 2x2 


Wavelength 


Cu Ka(1.54 A) 


1.00 A 


Frames 


364 


400 


A<I> 


0.5° 


0.5° 


Crv^tal to nlate distance 


150 mm 


135 mm 


Time/frame 


20 min 


1 sec 


Number of Observations 


209,891 


416,207 


Data Reduction Program 


HKL 


HKL2000 


Unique reflections 


10,824 


18,308 


Reflections Used 


10,114 


16,862 




2.4 A (2.5-2.4 A) 


2.0 A (2.1-2.0 A) 




93.8% (71.6%) 


92 6 % (73 0 %) 


N/li litinlinifv 


6.3 




Mosiacity 


0.502 


0.332 


Rsym (on 1) 


4.2% (17.5%) 


10.1 %(25.6%) 


Space Group 


P212121 


P212121 


a 


56.09 A 


56.08 A 


b 


66.43 A 


65.76 A 


c 


70.54 A 


70.51 A 


Wilson B-value 


39.05 A 2 


29.26 A 2 


Values for data in the last resolution shell are given in parentheses 
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Search Model: 


Progesterone 


AR Model 




(PDBfile 1A28) 




Program Used 


AmoRe 


AMoRe 


Resolution Range 


8.0-4.0 A 


8.0-4.0 A 


Radius of Integration 


25 A 


25 A 


Number of Reflections 


2.393 


2,393 


Number of Atoms 


2,019 


2,094 


RF Correlation (2 nd solution) 


0.16 (0.12) 


0.13(0.11) 


TF Correlation (2 nd solution) 


0.31 (0.20) 


0.23 (0.14) 


TF R-factor (2 nd solution) 


49.0% (52.7%) 


52.1% (54.0%) 


Rigid Body Correlation 


0.34 


0.28 


Rigid Body R-factor 


48.1% 


50.4% 


Structure Refinement 







The structure was first refined with the initial 2.4 A data set (2a 
data, 9,818 reflections) by the method of simulated annealing with 
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program X-PLOR (Brunger, A.T., Kuriyan, J. 8b Karplus, J. (1987) 
"Crystallographic R-factor refinement by molecular dynamics", Science 
235: 458-460) in four cycles to an R-factor of 27.7%. Each refinement 
cycle consisted of a least-squares minimization, simulated annealing at 
5 3000°, and individual isotropic B-factor refinement. The first cycle, with 
the Progesterone molecular replacement model unmodified for the 
sequence differences between AR and PR, gave an R-factor of 33.8%. The 
model was then rebuilt using the AR amino acid sequence and a second 
refinement cycle gave an R-factor of 29.6%. At this stage of the 

10 refinement, the DHT molecule could be clearly seen in the difference 
electron density map. 

After each cycle, the structure was carefully examined using 
molecular computer graphics program Chain (Sack, John S. (1988) 
"CHAIN- A Crystallographic Modeling Program", J. Mol Graphics 6: 224- 

15 225) and modifications were made to the structure as needed. Several 

residues, from both the N- and C-termini of the molecule, which were not 
seen in the electron density maps were removed from the model. After 
the second cycle of refinement, the DHT was added to the model. Solvent 
molecules were added where there were 3a peaks in both the 2Fo - Fc 

20 and Fo - Fc electron density maps and removed if their B-factor went 
above 60 A 2 . After four cycles of X-PLOR refinement, a careful 
examination of the electron density showed the model to be much 
improved, although molecular refitting still needed to be done in some 
regions. The density is clear except for some of the loop regions, 

25 particularly the loop between helices I and II, which was also poorly 
modeled in the PR structure. 

Table 4: Refinement Statistics (X-PLOR) 

30 Part I: 2a data (9,818 reflections) to 2 A A 



Cycle 1 


251 residues 


No ligand 


0 waters 


R = 


33.8 % 


Cycle 2 


248 residues 


No ligand 


0 waters 


R = 


29.6% 


Cycle 3 


247 residues 


ligand 


1 8 waters 


R = 


28.3 % 


Cycle 4 


246 residues 


ligand 


40 waters 


R = 


27,7% 



35 

Part II: 2a data (15,067 reflections) to 2.0 A 

Cycle 5 246 residues ligand 32 waters R = 27.9 % 

Cycle 6 246 residues ligand 57 waters R = 26.8 % 
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Cycle 7 246 residues ligand 58 waters R = 26.7 % 

Cycle 8 246 residues ligand 1 06 waters R = 24.2% 



At this stage of the refinement, the higher resolution data collected at the 
5 APS synchrotron became available. Four additional X-PLOR refinement 
cycles were performed with the 2.0 A data set (2a data, 15,067 
reflections) following the same protocol. The final structure has an R- 
factor of 24.2% with a total of 106 solvent molecules. The final 
refinement statistics are presented in table 5. 

10 Table 5: Final Refinement Parameters 

Resolution Range 1 0.0 - 2.0 A 

Reflections 15,067 

R-factor 24.2 % 

R-free 31.2% 

15 # residues 246(672-917) 

# atoms 2118 (1991 atoms, 21 DHT, 106 waters) 
RMS deviations 

bond lengths 0.014 A 

bind angles 1.594° 

20 Improper angles 1.558° 

Average B-factors 

Protein 25.02 A 2 

DHT 14.40 A 2 

Water 30.21 A 2 

25 Wilson B-factor 29.26 A 2 

Description of the Molecule 

The structure of AR-LBD is complete from residues 671 through 
917 for the wild-type and 672 to 918 for the LNCaP mutant. Analysis of 

30 the structures with program PROCHECK showed only minor exceptions 
to the allowed geometry. In the wild-type structure, the first six residues 
of the chain (664 - 670) are not seen in the electron density and are 
probably disordered. This leaves only one residue before the initial 
residue of the first a-helix (HI) in the wild-type structure, none in the 

35 LNCaP mutant structure. On the C- terminal end, the last two residues 
(918-919) are not seen in the electron density of the wild-type 
structure, but only the last is missing in the mutant. In addition, since 
the loop between helices 9 and 10 (residues 845-850) is not well defined, 
it has been modeled as poly-alanine. 
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Folding and Packing 

As expected, the AR LBD has the same overall three- 
dimensional structure as those of the other nuclear hormone receptor 
5 LBDs. The molecule is folded into a "helical sandwich" consisting of 10 
a-helices. There are four small pieces of beta strand, forming two short 
beta-sheets; one in the core of the molecule between helices 5 and 6 near 
the ligand binding site, and the other formed by the loop between helices 
8 and 9 and the C-terminus. This latter sheet, also seen in the PR LBD 

10 structure, holds helix 12 in the closed, agonist conformation, close to 
and capping the ligand binding site. 
Lack of Dimer Formation 

Studies have indicated that the estrogen, progesterone, and 
androgen receptors all function as homodimers and that AR LBD forms 

15 dimers in solution. Thus it could be expected that the AR LBD domains 
might form homodimers in the crystal similar to those previously seen in 
the RXR-a and estrogen receptor (ER) LBD crystal structures. In the PR 
LBD structure, the two monomers in the asymmetric unit are related by 
a dyad, but the two-fold-symmetric configuration is strikingly different 

20 from that of the RXR and ER homodimers and the area buried in this 
configuration is much smaller than would be expected for stable dimer 
formation. In the AR LBD crystal, the ligand-binding domains are 
unmistakably monomeric, and there are no twofold axes relating 
domains. Moreover, the homodimer interaction seen in the structures of 

25 ER and RXR LBDs is not possible for the AR LBD, as the C-terminal tail 
is bound to the groove formed by helices 9 and 10, thereby obstructing 
the contact region between monomers in RXR and ER homodimers. 
Whether this observation reflects a non-dimeric state of the AR LBD in 
the functional AR dimer or is an artifact of the conditions used for AR 

30 LBD crystallization remains to be determined. It is noteworthy that the 
ER LBD constructs used for crystallization have been truncated to 
remove an analogous C-terminal extension. 
Comparison with Progesterone Receptor 

While there is only 55% sequence identity between AR LBD and 

35 PR LBD, there is a 77% sequence similarity, and as expected, the three- 
dimensional structures of these two LBDs are very similar with an r.m.s. 
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deviation of 1.3 A between corresponding Ca atom positions. As with PR, 
AR LBD has no helix 2, but its helix 12 is longer than those of RXR or 
TR. In the case of AR, while helices 10 and 1 1 are nearly contiguous, 
there is a proline residue at position 868 that causes a kink between the 
two helices . 

Comparison with theoretical AR model 

The theoretical AR model obtained from MMS/CADD and the AR 
structure have an r.m.s. deviation of 1.29 A for the 247 alpha carbons. 
More importantly, the hormone binding site is virtually identical with the 
exception of the side chains of Met 732(749), Leu 863(880), and Leu 
864 (881) which are in different rotomers. This causes the binding cavity 
to be more compact in the AR structure. Also, there is a flip of the side 
chain of Asn 688(705) so that the ND2 atom is in position to make a 
hydrogen bond to the carbonyl off of the D-ring. 

Table 6: Comparison of AR-LBD to PR-LBD and Theoretical model 

v Calpha Main Side Total 

ARvs. Pr 1.22(246) 1.27(983) 1.80(772) 1.53(1,755) 

ARvs. CADD 1.25(246) 1.31 (983) 2.41(971) 1.93(1,954) 



Binding of Dihydrotestosterone 

At the end of the molecular replacement procedure with the PR 
LBD structure without progesterone as search model, the largest piece of 
difference electron density, at approximately the 3a level, was found at 
the progesterone-binding site. Replacing the bound progesterone agonist 
(which has a carboxyl group at the 17-position) with a model of d- 
hydrotestosterone (DHT, which has a hydroxyl group at the 17-position) 
produced an even better fit to the difference electron density, indicating 
that DHT binds to AR LBD in an almost identical fashion to the way 
progesterone binds to PR LBD. Both agonists interact with helices 3, 5, 
and 1 1 of their respective LBDs. Ring A, which is identical in the two 
steroids, makes similar interactions with the side chains of Q71 1, M745, 
R752 (Q725, M759, R766 in PR LBD), and a conserved water molecule. 
The interactions with ring C are also similar, with close contacts to the 
mainchain of L704 (L718 in PR LBD) and sidechain of N705 (N719 in PR 
LBD). The contact between C18 and the Oyl of T877 is unique to the 
wild-type AR LBD, as the corresponding cysteinyl side chain is pointed 
away from the steroid in the PR LBD structure. 
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Since progesterone and DHT differ in the substituent on ring D, 
it is expected that interactions with respective receptors will differ in this 
region. In the AR LBD structure, N82 of N705 makes a hydrogen bond to 
the D-ring hydroxyl of DHT. A similar interaction could be made 
5 between progesterone and the PR LBD if there were a flip of both the 
steroid acetyl group and the side chain of N719. This would place the 
oxygen approximately 3.2 A from the N82 atom of Asn 719. The ligand 
contact surface area is slightly larger for progesterone in PR than for 
DHT in AR (483 vs. 448 A 2 ) but they are both considerably smaller than 

10 the ligand contact surface area in TR (559 A 2 ), PPARy (583 A 2 ), or the 
Vitamin D receptor (677 A 2 ) . 

Figure 3 shows two orthogonal views of the omit electron density 
map of dihydrotestosterone (DHT) in the hormone-binding site of AR- 
LBD. There are hydrogen bonds between the steroid and the side chains 

1 5 of Arg 752 and Asn 705. 

Table 7: Dihydrotestosterone Contacts (3.4 A) 

Hydrogen Bonds 



20 



25 



30 



35 



03 
03 


Arg 752 Nh2 
Gin 711 Ns2 


2.89 A (2.77 A) 
3.36 A (3.20 A) 


O20 
O20 


Asn 705 N52 
Thr 877 Oy1 


2.80 A (3.20 A) 
2.70 A (N/A) 


Possible Close Contacts 




C11 


Leu 704 O 


3.31 A 


C12 


Asn 705 N52 


3.07 A 


C17 


Asn 705 N52 


3.34 A 


C19 


Met 745 S5 


3.38 A 


C18 


Thr 877 Oy1 


3.07 A 



Comparison with Progesterone binding 

Comparison of the structure of DHT in the AR-LBD with the 
40 structure of progesterone in the PR-LBD (Williams, S.P. 8s Sigler, P.B. 

(1998) "Atomic Structure of Progesterone Complexed with its Receptor", 
Nature 393, 391) shows a similar mode of binding. Ring A, which is 
identical in the two steroids, makes similar interactions with the side 
chains of Q71 1, M745, R752, Q71 1 and a conserved water molecule 
45 (table 8). The interaction with ring C are also similar, with close 
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contacts to the mainchain of L704 and sidechain of N705. The contact 
from C18 to the Oyl of T877 is unique to AR-LBD, as the corresponding 
cysteine sidechain is pointed away from the steroid in the PR-LBD 
structure 

Since progesterone and DHT differ in the substitution off of ring 
D it is expected that there will be different interactions with the protein 
in this region. In the AR structure, the N52 atom of Asn 705 makes 
hydrogen bond to the D-ring hydroxyl. 

A similar interaction could be made in the PR if there were a flip 
of both the steroid carboxyl group and the side chain of N719. This 
would place the carboxyl oxygen approximately 3.2 A from the N82 atom 
of Asn 719. In AR-LBD, there is also a close contact to the side chain of 
T877 which is absent in the PR-LBD structure. 

Figure 4 shows comparison of AR and PR steroid binding 
Comparison of the binding of dihydrotestosterone to AR-LBD (top) and of 
progesterone to PR-LBD (bottom). Note that an additional hydrogen bond 
interaction would be possible if both the sidechains of both N719 and the 
progesterone were flipped. 

Table 8: Comparison of AR and PR steroid binding 



AR 



PR 



25 



30 



Ring A 
03: 



C19 



H-bond to R752 NH2 (2.9 A) 

H-bond to water (3. 5- A) 

SC of Q71 1 in different rotomer 
distance to 03 is 3.4 and 4.13 A 

Contact to M745 SD (3.4 A) 



H-bond to R766 NH2 (2.8 A) 

H-bond to water (3.1 / 3.4 A) 

Contact to SC of Gin 725 
distance to 03 is 3.2 and 3.3 A 

Similar orientation (3.5 A) 



35 



40 



C2: 

Ring C 

C11 
C12 
C18 



SC ofQ711 (3.5 A) 



LO704 O (3.3A) 

Contact to N705 N82 (3.1 A) 

Contact T877 Oy1 (3.1 A) 



different rotomer (3.2 & 3.3) 
distance to C4 is 4.1 A 



(3.5A) 

Contact to N719 051 (3.4 A) 

SC of C891 pointing away 
distance to Sy is 3.8 A 



45 



Ring D 
O20/C21 



021 in AR is close to C21 in PR (Possible flip of Carboxyl in PR?) 
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10 



N/A O20: Contact to C891 Ca (3.2 A) 

O20 H-bond N705 N52 (2.8A) C21 : Contact to N719 OD1 (3.2 A) 

O20: Contact T877 Cy1 (2.7 A) SC of C891 pointing away 

Contact N705 N52 (3.3 A) 



Structure of the Complex of DHT with the LBD of the LNCaP Mutant 

In the LNCaP mutant, T877 is replaced by an alanine residue. 
The mutant LBD structure has an r.m.s. deviation of 0.8 A compared to 
the wild-type structure, close to the expected r.m.s. deviation due to the 

1 5 estimated errors in the coordinates. In particular, the binding of DHT is 
essentially identical by wild-type and mutant LBDs except at the point of 
mutation. Here the replacement of T877 by alanine leaves additional 
space off the D-ring of DHT to accommodate a larger substituent on 
position 17. This may explain the promiscuous ability of the LNCaP 

20 mutant, unlike wild-type AR, to bind to a variety of other hormones and 
analogs like some progestins, estrogens and Cortisols that differ from 
DHT in substitution at position 17. For example, the binding of 
flutamide, estradiol, and progesterone to the LNCaP mutant can activate 
the mutant receptor. Conversely, mutation of T877 to residues with 

25 larger sidechains such as aspartic acid and lysine would be expected 
completely preclude the binding of ligands with any substituent at 
position 17 of the D-ring and such mutations have been shown to totally 
eliminate androgen binding. 
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32 . 398 
25.128 
25. 027 
24.521 
23. 721 
24.295 
25. 194 
25.835 
24 . 525 
22 .262 
21.542 
21.831 
20. 433 
20.229 
18 .896 
18 . 847 
17 . 624 
17 . 686 
16. 471 
16.446 
15.238 
19.701 
19. 984 
18.730 
17 . 935 
18.551 
17 .785 
18.027 
17 . 044 
19.242 
19. 455 
20 . 900 
21.327 
21 . 790 
21.486 
22 .547 
19 . 089 
18 . 342 
19.538 
19.212 
19. 951 
21.746 
17 . 698 
17 . 155 
17 . 003 
15.538 
14 . 987 
13. 457 
15. 349 
14 .864 
13. 881 
15. 402 
14 . 855 
15. 468 
15. 392 
15. 314 
14 .269 
14 .292 
15. 301 
13.326 
15.083 
14 . 180 
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880 
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7 . 589 
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■8 . 438 
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698 
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-3.497 
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-7 



.897 
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. 964 
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-7 . 115 
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-2 . 856 
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6.701 
6. 932 
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2 . 
3. 
2 . 

3. 

4 , 
6. 



1.00 26.64 
1.00 30.75 
1.00 31.03 
1 . 00 29.82 
1.00 29.36 
1.00 30.17 
1.00 30.12 
1.00 31.24 
1.00 31. 13 
1. 00 29.21 
1. 00 29.43 
1.00 28.58 
1.00 28.82 
1.00 27.73 
1.00 26.05 
1 . 00 25.95 
1.00 25.58 
1.00 25.08 
1 . 00 24 . 57 
1 . 00 24.77 
1.00 24 . 60 
1.00 30.06 
1 . 00 32 . 08 
1.00 30 . 74 
1.00 31.41 
1.00 32.82 
1.00 35.91 
1.00 30.50 
1.00 30.31 
1.00 29.34 
1.00 28.01 
1.00 28.50 
1.00 29.62 
1.00 32.01 
1.00 33.24 
1.00 32.18 
1.00 26.36 
1.00 26.08 
1 . 00 25.49 
1.00 24.45 
1.00 22.82 
1.00 18.24 
1.00 25.03 
1.00 25. 67 
1.00 25.25 
1.00 25.02 
1.00 25. 94 
1.00 26. 21 
1.00 26.31 
1.00 23. 91 
1 . 00 24.19 
1.00 25.02 
1 . 00 25 . 39 
1.00 26. 15 
1.00 28.30 
1.00 29.76 
1.00 32.39 
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1.00 32.09 
1.00 33.31 
1 . 00 24.71 
1.00 25.52 
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927 N MET 787 

928 CA MET 787 

929 CB MET 787 

930 CG MET 787 

931 SD MET 787 

932 CE MET 787 

933 C MET 787 

934 O MET 787 

935 N ARG 78 8 

936 CA ARG 78 8 

937 CB ARG 78 8 

938 CG ARG 7 88 

939 CD ARG 788 
94 0 NE ARG 788 

941 CZ ARG 788 

942 NH1 ARG 788 
94 3 NH2 ARG 788 

944 C ARG 788 

945 O ARG 788 

946 N HIS 789 

947 CA HIS 789 

948 CB HIS 789 

949 CG HIS 789 

950 CD2 HIS 789 

951 ND1 HIS 789 

952 CE1 HIS 789 

953 NE2 HIS 789 

954 C HIS 789 

955 O HIS 789 

956 N LEU 790 

957 CA LEU 790 

958 CB LEU 790 

959 CG LEU 790 

960 CD1 LEU 790 

961 CD2 LEU 790 

962 C LEU 790 

963 O LEU 790 

964 N SER 791 

965 CA SER 791 

966 CB SER 791 

967 OG SER 791 

968 C SER 791 

969 O SER 791 
97 0 N GLN 7 92 

971 CA GLN 7 92 

972 CB GLN 792 
97 3 CG GLN 7 92 

974 CD GLN 792 

975 OE1 GLN 792 
97 6 NE2 GLN 7 92 

977 C GLN 792 

978 O GLN 792 

979 N GLU 793 

980 CA GLU 793 

981 CB GLU 793 

982 CG GLU 793 

983 CD GLU 793 

984 OE1 GLU 793 

985 OE2 GLU 7 93 

986 C GLU 793 

987 O GLU 793 

988 N PHE 794 
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16.246 
16. 548 
18 . 018 
18 . 883 
20. 578 
21.285 
15.736 
15.387 
15. 521 
14 .738 
14 . 833 
14 . 166 
14 .217 
13. 426 
13. 899 
15. 182 
13.079 
13.312 
12 . 596 
12 . 920 
11.587 
11.377 
9. 970 
8.890 
9. 538 
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7 . 831 
11.369 
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12.413 
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10. 964 
9. 600 
9.237 
9.700 
9.421 
8 .479 
10.273 
8 . 629 

7 .702 

8 . 886 
8 .014 
8 . 460 
8 . 555 

. 383 
. 207 
. 648 
. 949 
6. 903 
9. 042 



-0. 146 
1.252 
1.528 
0. 925 

0. 861 
, 969 
. 173 
. 281 
. 752 
.499 

1 . 790 
2 . 474 

, 541 
. 996 
, 177 
. 960 
2.5 67 
2.475 
3.473 
1.339 
1.173 
-0.287 
-0 . 609 
-0.944 
-0.567 
-0.856 



1. 

2. 
3, 
1 
2 



1, 
1 . 
2 . 
1, 



7 . 

6, 
7 . 
7 , 



4 . 

3. 
2. 
2, 
1. 

7 , 
7 . 
7 



3. 
2. 
3. 
1. 

1 , 



2. 160 
2. 4 63 
2. 261 
. 314 
.788 
. 729 
. 588 
. 997 
0. 348 
-0. 625 
-1. 980 
-3. 174 
-4 .395 
-5.540 
-6.783 
-7.081 
-7 .754 
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1.00 23.53 
1.00 22.11 
1.00 20.46 
1.00 17.04 
1,00 20.46 
1.00 20.07 
1 . 00 23.39 
1 . 00 24 . 11 
1.00 24.89 
1.00 26.29 
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380 
056 
6. 375 
6.446 
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. 137 
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. 235 
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2 . 231 
2 . 394 
3.048 
4 . 218 
4 . 887 
6. 383 
6. 570 
7 .014 
3. 720 
4 . 216 
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1. 122 
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.00 28.55 
.00 32.52 



1.00 35.44 
1.00 39.11 
1.00 41.32 
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1.00 26.13 
1.00 26.50 
1. 00 26.36 
1.00 26.76 
1 . 00 29. 07 
1. 00 30.42 
1.00 31.35 
1.00 32.05 
1.00 32.56 
1.00 32 . 55 
1. 00 26.08 
1 . 00 25 . 72 
1 . 00 25 . 92 
1.00 25.41 
1 . 00 23.94 
1.00 23.32 
1 . 00 22 . 41 
1 . 00 23 . 17 
1.00 25.47 
1.00 25.06 
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1.00 26.51 
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1.00 26.02 
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1.00 26.31 
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1.00 28 . 65 
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.00 23.72 



1 
1 
1 

1 . 
1 . 

1 , 
1 , 
1 , 
1 
1 
1 
1 
1 



00 22.89 
00 23.65 
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00 27.08 
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00 21.26 
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ATOM 98 9 CA PHE 

ATOM 990 CB PHE 

ATOM 991 CG PHE 

ATOM 9 92 CD1 PHE 

5 ATOM 9 93 CD2 PHE 

ATOM 9 94 CE1 PHE 

ATOM 9 95 CE2 PHE 

ATOM 9 96 CZ PHE 

ATOM 997 C PHE 

10 ATOM 998 O PHE 

ATOM 999 N GLY 

ATOM 1000 CA GLY 

ATOM 1001 C GLY 

ATOM 1002 O GLY 

15 ATOM 1003 N TRP 

ATOM 100 4 CA TRP 

ATOM 1005 CB TRP 

ATOM 100 6 CG TRP 

ATOM 1007 CD2 TRP 

20 ATOM 1008 CE2 TRP 

ATOM 1009 CE3 TRP 

ATOM 1010 CD1 TRP 

ATOM 1011 NE1 TRP 

ATOM 1012 CZ2 TRP 

25 ATOM 1013 CZ3 TRP 

ATOM 1014 CH2 TRP 

ATOM 1015 C TRP 

ATOM 1016 O TRP 

ATOM 1017 N LEU 

30 ATOM 1018 CA LEU 

ATOM 1019 CB LEU 

ATOM 1020 CG LEU 

ATOM 1021 CD1 LEU 

ATOM 1022 CD2 LEU 

35 ATOM 1023 C LEU 

ATOM 1024 O LEU 
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ATOM 102 6 CA GLN 

ATOM 1027 CB GLN 

40 ATOM 102 8 CG GLN 

ATOM 1029 CD GLN 

ATOM 1030 OE1 GLN 

ATOM 1031 NE2 GLN 

ATOM 1032 C GLN 

45 ATOM 1033 O GLN 

ATOM 1034 N ILE 

ATOM 1035 CA ILE 

ATOM 1036 CB ILE 

ATOM 1037 CG2 ILE 

50 ATOM 1038 CGI ILE 

ATOM 1039 CD1 ILE 

ATOM 104 0 C ILE 

ATOM 1041 O ILE 
ATOM 1042 N THR 
55 ATOM 104 3 CA THR 
ATOM 104 4 CB THR 
ATOM 104 5 OG1 THR 
ATOM 104 6 CG2 THR 
ATOM 10 4 7 C THR 
60 ATOM 104 8 O THR 
ATOM 10 4 9 N PRO 
ATOM 1050 CD PRO 
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3.701 1.00 32.86 

3.633 1.00 32.89 

4.648 1.00 33.16 

2.421 1.00 33.46 

6.294 1.00 30.02 

7.147 1.00 30.51 

6.462- 1.00 28.60 

7 .648 1.00 27.76 

7 .829 1.00 26.34 

9.007 1.00 25.78 

8.046 1.00 25.57 

9.211 1.00 25.52 

7 .552 1.00 28.59 

6.496 1.00 29.81 

8.678 1.00 28.71 

8.874 1.00 28.68 

10.142 1.00 29.24 

9.815 1.00 31.49 

10.746 1.00 29.69 

9.116 1.00 28.69 

9.796 1.00 29.15 

8.565 1.00 28.18 

7 .569 1 . 00 27 . 62 
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ATOM 1051 CA PRO 801 

ATOM 1052 CB PRO 801 

ATOM 1053 CG PRO 801 

ATOM 1054 C PRO 801 

ATOM 1055 O PRO 801 

ATOM 1056 N GLN 802 

ATOM 1057 CA GLN 802 

ATOM 1058 CB GLN 802 

ATOM 1059 CG GLN 802 

ATOM 1060 CD GLN 802 

ATOM 1061 OE1 GLN 802 

ATOM 1062 NE2 GLN 802 

ATOM 1063 C GLN 802 
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ATOM 1065 N GLU 803 

ATOM 1066 CA GLU 803 

ATOM 1067 CB GLU 803 

ATOM 1068 CG GLU 803 

ATOM 1069 CD GLU 803 
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ATOM 1071 OE2 GLU 803 

ATOM 1072 C GLU 803 

ATOM 107 3 O GLU 8 03 

ATOM 107 4 N PHE 8 04 

ATOM 107 5 CA PHE 804 

ATOM 107 6 CB PHE 804 
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16.765 14.496 8.936 1.00 13.36 

17.214 13.217 8.647 1.00 12.84 

14.034 13.802 12.157 1.00 20.31 

14.807 12.939 12.564 1.00 21.30 

14.187 15.086 12.463 1.00 20.30 

15 271 15.503 13.339 1.00 20.09 

15.250 17.008 13.582 1.00 19.58 

15.552 17.834 12.330 1.00 20.47 

15.704 19.281 12.707 1.00 19.84 

16.816 17.343 11.670 1.00 19.41 

15.172 14.767 14.651 1.00 19.83 

16.142 14.205 15.106 1.00 20.77 

13.980 14.719 15.223 1.00 20.17 

13.765 14.026 16.494 1.00 21.27 

12.372 14.332 17.078 1.00 22.13 

12.142 16.017 17.706 1.00 27.50 

13.938 12.515 16.378 1.00 20.36 

14.575 11.904 17.241 1.00 20.30 

13.348 11.903 15.350 1.00 19.67 

13.491 10.458 15.160 1.00 18.10 

12.668 9.944 13.989 1.00 17.25 

11.195 9.877 14.279 1.00 16.70 

10.377 9.142 12.911 1.00 19.42 

10.144 10.560 11.908 1.00 16.21 

14.947 10.062 14.979 1.00 17.75 

15.371 9.038 15.490 1.00 18.41 

15.712 10.871 14.257 1.00 17.31 

17.116 10.592 14.054 1.00 16.27 

17.729 11.514 12.994 1.00 15.01 

19.171 11.154 12.733 1.00 14.63 

19.679 11.569 11.371 1.00 15.42 

19.422 13.053 11.092 1.00 15.64 
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20.232 

17 .857 

18 .731 
17 . 522 
18.175 
17 . 628 
17 . 989 
18 . 932 
16. 766 
16.459 
14 . 966 
14 .406 
12. 95-4 
15.176 
17 .116 
17 .213 
17 . 537 
18 .215 
18 .346 
17 . 148 
17.511 
16.744 
19.598 
20.189 
20.153 
21.455 
22.004 
23. 342 
24 .488 
23. 559 
21. 330 
22 . 282 
20. 136 
19. 859 
19. 137 
19. 818 
20. 946 
19. 349 
21. 604 
19. 991 
21. 126 
18 . 971 
18 .058 
19.255 
18 . 453 
17 . 697 
16. 640 
19. 169 
18 . 610 
20. 395 
21 . 099 
21 . 620 
22 . 222 
22 . 600 
22 . 915 
22 . 172 
23. Ill 
21. 994 
22. 913 
22. 298 
22. 175 
20. 939 



13.940 
10.726 
9.908 
11.747 
11.931 
13. 139 
10. 691 
10.207 
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8. 199 
7 . 716 
6. 780 
7 . 636 
6.447 
6.456 
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4 .856 
4 . 618 
4 . 082 
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4 . 087 
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. 620 
.498 

0. 226 
-0.086 

1. 113 
-1.245 
-1.753 

0. 187 
0. 981 
-0.700 
-0.804 
-0.099 
1.378 
-0 . 692 



11 . 928 
15.376 
15 . 677 
16. 166 
17 . 461 
18 . 155 
18 . 348 
18 . 996 
18.392 
19.186 
19.263 
20. 651 
20. 594 
21. 674 
18.722 
19.509 
17 .456 
16. 959 
15 . 438 
14 .574 
13.164 
14.746 
17 . 582 
17 . 554 
18 .084 
18 .734 
18 . 937 
19. 670 
18 .802 
20.037 
20 .098 
20 . 629 
20 . 662 
21 . 950 
22 . 821 
22 . 841 
23. 624 
22 . 036 
23. 615 
22 . 014 
22. 801 
21. 753 
22. 530 
20.709 
20.369 
19.062 
19.274 
20 .294 
19.779 
20.779 
20 .747 
19.325 
18 .706 
19.341 
17 . 953 
21 . 826 
21.802 
22 . 809 
23. 947 
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14 . 15 
16.89 
16.07 
16. 53 

17 . 68 
16.91 
19.03 
20.50 
19.36 

18 . 99 
19.09 
20 .20 
21 .29 
18 .75 
18.88 
20.78 
17 . 63 
15. 58 
14 . 70 
14 . 14 
13. 66 
13. 62 
15. 42 

17 . 27 
13. 97 
12. 94 
12. 69 

12. 03 
12 . 16 

13. 12 

12. 97 
12 . 99 

13. 55 
14 . 19 
15.20 
16.11 
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18 .37 
17 . 54 
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14 .71 
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15 . 96 
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15 . 51 
16.05 
17 . 02 

16. 07 
17 . 04 
16.75 
17 . 43 
17 . 01 
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17 .86 

18. 49 
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18 . 91 

19. 07 
17 .73 
18.04 
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ATOM 122 6 ND2 ASN 823 

ATOM 1227 C ASN 823 

ATOM 1228 O ASN 823 

ATOM 12 2 9 N GLN 824 

ATOM 12 30 CA GLN 8 2 4 

ATOM 1231 CB GLN 824 

ATOM 1232 CG GLN 82 4 

ATOM 1233 CD GLN 824 

ATOM 1234 OE1 GLN 824 

ATOM 1235 NE2 GLN 824 

ATOM 1236 C GLN 824 



20 516 -0.346 26.933 1.00 17.73 

23 302 -2.226 24.385 1.00 20.13 

22 615 -3.184 24.040 1.00 20.43 

24 392 -2.385 25.180 1.00 20.85 

25 303 -1.373 25.730 1.00 21.00 
24 805 -3.720 25.631 1.00 22.05 

26 016 -3.444 26.523 1.00 21.47 
26 554 -2.197 26.001 1.00 21.80 

23 706 -4.320 26.458 1.00 22.95 

22 988 -3.594 27.151 1.00 23.12 

23 585 -5.640 26.418 1.00 24.79 
22 544 -6.316 27.195 1.00 26.35 

22 513 -7.860 26.916 1.00 27.19 

23 864 -8.515 27.282 1.00 27.82 

21 362 -8.524 27.676 1.00 27.72 

22 742 -6.047 28.692 1.00 26.79 
21 777 -5.849 29.421 1.00 26.79 

23 992 -5.963 29.136 1.00 27.83 

24 240 -5.732 30.550 1.00 29.78 
25.406 -6.593 31.063 1.00 32.59 

26 747 -5.908 30.959 1.00 35.35 

27 117 -5.518 29.825 1.00 38.62 
27 431 -5.776 32.011 1.00 36.18 
24 377 -4.266 30.937 1.00 29.73 

24 899 -3.930 32.007 1.00 30.00 
23 839 -3.403 30.085 1.00 29.43 

23 878 -1.974 30.342 1.00 28.69 

25 216 -1.317 30.125 1.00 27.42 

26 221 -1.982 29.938 1.00 26.73 

25 208 0.010 30.135 1.00 28.29 

26 410 0.831 29.947 1.00 28.64 
26.023 2.110 29.195 1.00 28.29 
25.083 1.940 27.991 1.00 28.32 

24 046 3.022 28.031 1.00 27.27 

25 831 1.953 26.653 1.00 27.18 

26 948 1.164 31.349 1-00 28.62 

26 341 0.747 32.342 1.00 28.84 
28.060 1.897 31.441 1.00 28.49 
28.642 2.268 32.741 1.00 29.80 
29.865 3.169 32.576 1.00 30.45 

30 924 2.626 31.666 1-00 32.84 

31 517 1.345 32.194 1.00 35.27 

32 433 0.688 31.161 1.00 36.20 

33 498 1.623 30.710 1.00 37.22 
27.621 3.016 33.587 1.00 30.25 
27.353 2.655 34.731 1.00 31.02 

27 065 4.080 33.029 1.00 29.98 
26.070 4.852 33.735 1.00 29.55 
26.458 6.323 33.774 1.00 31.17 
27.832 6.544 34.350 1.00 32.55 
28.787 5.856 33.985 1.00 33.56 
27 952 7.520 35.246 1.00 34.42 
24 807 4.665 32.943 1.00 28.73 
24 476 5.473 32.091 1.00 29.00 
24*.127 3.562 33.199 1.00 27.99 
22.893 3.227 32.514 1.00 27.77 
22 590 1.731 32.738 1.00 28.13 
21 343 1.158 32.077 1.00 28.93 

21 331 1.302 30.551 1.00 30.20 

22 300 0.976 29.855 1.00 30.02 
20 211 1.775 30.028 1.00 29.72 
21.723 4.115 32.960 1.00 27.61 
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ATOM 1267 C PHE 827 

ATOM 1268 O PHE 827 

ATOM 1269 N ASP 828 

ATOM 1270 CA ASP 828 

ATOM 1271 CB ASP 828 

ATOM 1272 CG ASP 828 

ATOM 1273 OD1 ASP 828 

ATOM 1274 OD2 ASP 828 

ATOM 127 5 C ASP 828 

ATOM 127 6 O ASP 828 
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20 747 4.275 32.226 1.00 27.58 

21 833 4.752 34.122 1.00 27.13 

20 742 5.590 34.595 1.00 26.37 
20.815 5.802 36.113 1.00 28.08 
19 430 5.823 36.792 1.00 31.02 
19 493 5.693 38.335 1.00 33.41 
18.086 5.725 39.002 1.00 34.57 
17 196 4.516 38.739 1.00 35.55 
20*679 6.917 33.876 1.00 24.74 

19 625 7.518 33.799 1.00 25.30 

21 794 7.375 33.330 1.00 24 .,06 
21.830 8.646 32.597 1.00 23.44 
23 247 9.191 32.573 1.00 25.61 
23*768 9.527 33.930 1.00 28.86 

22 916 10.067 34.890 1.00 29.49 
25 091 9.284 34.268 1.00 29.08 

23 373 10.356 36.156 1.00 29.57 
25 551 9.571 35.533 1.00 29.80 
24.688 10.108 36.479 1.00 30.14 
21 344 8.463 31.178 1.00 21.78 

20 808 9.380 30.568 1.00 21.42 

21 581 7.277 30.636 1.00 20.42 
21.145 6.937 29.299 1.00 18.74 
21.814 5.644 28.857 1.00 17.62 
21*.238 5.083 27.610 1.00 16.77 
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15 921 3.972 24.074 1.00 25.50 

14 738 3.800 23.491 1.00 24.36 

13 632 4.173 24.096 1.00 25.37 

14 676 3.366 22.250 1.00 24.31 

15 086 8.667 26.302 1.00 22.46 
14 354 8.964 25.351 1.00 22.89 
14.655 8.593 27.550 1.00 22.76 
13 276 8.859 27.923 1.00 22.91 
13 126 8.762 29.429 1.00 23.88 

11 739 9.050 29.870 1.00 24.65 
11.693 9.332 31.596 1.00 29.43 
10.059 10.026 31.651 1.00 29.67 

12 879 10.262 27.513 1.00 23.47 
11.740 10.512 27.097 1.00 23.79 

13 782 11.198 27.768 1.00 23.63 
13.562 12.599 27.423 1.00 23.80 
14.676 13.482 28.013 1.00 23.64 

14 532 13.679 29.544 1.00 23.87 

15 519 13.864 30.270 1.00 23.24 

13 293 13.628 30.030 1.00 24.57 
13.403 12.761 25.905 1.00 23.94 
12 463 13.397 25.445 1.00 24.48 

14 240 12.093 25.123 1.00 23.69 
14.121 12.165 23.673 1.00 24.70 

15 340 11.532 23.007 1.00 25.39 

16 491 12.489 22.872 1.00 25.49 
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ATOM 1361 CB LEU 838 

ATOM 1362 CG LEU 8 38 

ATOM 1363 CD1 LEU 838 

ATOM 1364 CD2 LEU 838 

ATOM 1365 C LEU 838 

ATOM 13 6 6 O LEU 8 38 

ATOM 1367 N ASP 839 

ATOM 1368 CA ASP 8 39 

ATOM 1369 CB ASP 839 

ATOM 1370 CG ASP 839 

ATOM 1371 OD1 ASP 839 

ATOM 1372 OD2 ASP 839 

ATOM 137 3 C ASP 839 

ATOM 1374 O ASP 839 

ATOM 1375 N ARG 84 0 

ATOM 137 6 CA ARG 84 0 

ATOM 1377 CB ARG 840 

ATOM 137 8 CG ARG 840 

ATOM 137 9 CD ARG 840 

ATOM 1380 NE ARG 840 

ATOM 1381 CZ ARG 840 
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3 816 11.952 22.557 1.00 41.54 

3.249 12.375 23.592 1.00 43.10 

3 192 11.400 21.613 1.00 42.60 
5 338 14.293 21.187 1.00 38.83 

4 285 14.645 20.672 1.00 39.02 
6.195 15.151 21.731 1-00 40.51 
5.916 16.580 21.828 1.00 42.20 
7.032 17.289 22.610 1.00 43.32 
6.657 18.639 23.261 1.00 45.47 
6.945 19.881 22.401 1.00 46.95 
8 319 20.371 22.542 1.00 48.57 
9.066 20.823 21.533 1.00 49.57 
8 580 20.860 20.294 1.00 49.89 

10.314 21.220 21.755 1.00 49.90 

5 776 17.220 20.457 1.00 43.27 
4.860 18.004 20.232 1.00 43.50 
6.663 16.876 19.528 1.00 44.53 
6.600 17.483 18.211 1.00 46.22 
7.983 17.572 17.510 1.00 45.82 
9.044 18.078 18.463 1.00 46.40 
8 383 16.237 16.918 1.00 46.46 
8.064 16.150 15.463 1.00 45.92 
5 534 16.913 17.286 1.00 48.04 
5.472 17.272 16.109 1.00 48.90 
4.737 15.976 17.786 1.00 49.91 
3 632 15.446 16.990 1.00 51.38 
3.577 13.878 16.889 1.00 51.11 
3 917 13.445 15.482 1.00 51.75 
4.523 13.192 17.870 1-00 50.67 
4.691 11.701 17.619 1.00 49.36 
2 384 16.003 17.659 1.00 52.63 
1.509 16.551 16.999 1.00 52.38 
2.356 15.939 18.986 1.00 54.81 
1 242 16.456 19.761 1.00 57.24 
1 424 16.129 21.247 1.00 56.69 
1.215 17.962 19.557 1.00 59.28 
1.847 18.704 20.304 1.00 59.69 
0.560 18.391 18.481 1.00 61.60 
0.402 19.810 18.130 1.00 63.67 
1.766 20.536 17.979 1.00 64.05 
2.751 20.268 16.470 1.00 65.30 

-0.441 19.854 16.848 1.00 64.63 

-1.618 19.471 16.889 1.00 64.70 

0.136 20.332 15.738 1.00 65.65 

-0.545 20.374 14.439 1.00 65.96 

-0.195 21.639 13.684 1.00 65.80 

-0.079 19.165 13.644 1.00 66.49 

-0.675 18.829 12.620 1.00 66.85 

0.998 18.533 14.127 1.00 66.74 

1 601 17.343 13.511 1.00 66.90 

3.110 17.337 13.730 1.00 66.74 

0.984 16.074 14.086 1.00 66.88 

1.675 15.092 14.345 1.00 66.35 
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ATOM 1468 NH1 ARG 854 

ATOM 1469 NH2 ARG 854 

ATOM 147 0 C ARG 854 

ATOM 14 71 0 ARG 854 

ATOM 1472 N ARG 855 

ATOM 1473 CA ARG 855 

ATOM 1474 CB ARG 855 

ATOM 1475 CG ARG 855 

ATOM 1476 CD ARG 855 

ATOM 1477 NE ARG 855 
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ATOM 17 95 CG GLU 8 93 

ATOM 17 96 CD GLU 8 93 

ATOM 17 97 OE1 GLU 8 93 

ATOM 17 98 OE2 GLU 8 93 

5 ATOM 17 99 C GLU 8 93 

ATOM 1800 O GLU 893 

ATOM 1801 N MET 894 

ATOM 1802 CA MET 894 

ATOM 1803 CB MET 894 

10 ATOM 1804 CG MET 894 

ATOM 1805 SD MET 894 

ATOM 1806 CE MET 894 

ATOM 1807 C MET 894 

ATOM 1808 O MET 894 

15 ATOM 1809 N MET 895 

ATOM 1810 CA MET 895 

ATOM 1811 CB MET 895 

ATOM 1812 CG MET 895 

ATOM 1813 SD MET 895 

20 ATOM 1814 CE MET 895 

ATOM 1815 C MET 895 

ATOM 1816 O MET 8 95 

ATOM 1817 N ALA 8 96 

ATOM 1818 . CA ALA 896 

25 ATOM 1819 CB ALA 8 96 

ATOM 18 20 C ALA 8 96 

ATOM 1821 O ALA 896 

ATOM 1822 N GLU 897 

ATOM 1823 CA GLU 897 

30 ATOM 18 2 4 CB GLU 8 97 

ATOM 1825 CG GLU 8 97 

ATOM 182 6 CD GLU 8 97 

ATOM 1827 OE1 GLU 897 

ATOM 1828 OE2 GLU 8 97 

35 ATOM 182 9 C GLU 8 97 

ATOM 1830 O GLU 8 97 

ATOM 1831 N ILE 898 

ATOM 1832 CA ILE 898 

ATOM 1833 CB ILE 898 

40 ATOM 1834 CG2 ILE 898 

ATOM 1835 CGI ILE 898 

ATOM 1836 CD1 ILE 898 

ATOM 1837 C ILE 898 

ATOM 1838 O ILE 898 

45 ATOM 1839 N ILE 899 

ATOM 1840 CA ILE 899 

ATOM 1841 CB ILE 899 

ATOM 1842 CG2 ILE 899 

ATOM 18 4 3 CGI ILE 8 99 

50 ATOM 18 4 4 CD1 ILE 8 99 

ATOM 1845 C ILE 899 

ATOM 18 4 6 O ILE 8 99 

ATOM 1847 N SER 900 

ATOM 18 4 8 CA SER 900 

55 ATOM 1849 CB SER 900 

ATOM 1850 OG SER 900 

ATOM 1851 C SER 900 

ATOM 1852 O SER 900 

ATOM 1853 N VAL 901 

60 ATOM 18 54 CA VAL 901 

ATOM 18 55 CB VAL 901 

ATOM 1856 CGI VAL 901 
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42 329 4.192 11.509 1.00 33.98 

42 441 5.498 12.280 1.00 37.38 

43.356 6.292 11.955 1.00 39.69 

41 624 5.729 13.216 1.00 39.34 

39 859 3.054 10.563 1.00 26.24 

39 750 1.992 11.180 1.00 27.29 

39*052 4.078 10.782 1.00 26.07 

37 968 3.974 11.744 1.00 26.28 

37 313 5.337 11.954 1.00 28.30 

38 256 6.389 12.509 1.00 32.56 
38 847 5.925 14.144 1.00 38.01 
37 260 5.830 15.037 1.00 35.95 
36 927 2.918 11.393 1.00 24.69 
36 337 2.311 12.287 1.00 24.64 
36 662 2.743 10.102 1.00 23.64 
35 705 1.738 9.645 1.00 22.83 

35 487 1.824 8.135 1.00 21.32 

34 669 3.006 7.693 1.00 21.17 
33 044 3.064 8.432 1.00 20.56 
32 088 2.305 7.205 1.00 22.81 

36 171 0.328 10.032 1.00 22.26 

35 469 -0.383 10.714 1.00 22.26 

37 362 -0.066 9.616 1.00 22.06 
37 867 -1.378 9.953 1.00 22.36 
39 243 -1.588 9.350 1.00 22.56 
37 914 -1.581 11.460 1.00 22.96 

37 520 -2.630 11.947 1.00 23.87 
38*377 -0.586 12.212 1.00 23.92 
38.455 -0.724 13.666 1.00 24.05 
39 128 0.502 14.313 1.00 25.98 
39 288 0.390 15.841 1.00 27.50 

39 150 1.718 16.555 1.00 27.88 

40 150 2.453 16.674 1.00 29.49 

38 036 2.018 17.013 1.00 29.22 
37 076 -0.901 14.276 1.00 22.80 
36 873 -1.774 15.094 1.00 22.95 
36 129 -0.071 13.884 1.00 22.19 
34 801 -0.178 14.459 1.00 21.88 
33 940 1.077 14.196 1.00 21.85 

32 478 0.836 14.587 1.00 22.66 
34.438 2.233 15.043 1.00 22.82 

33 490 3.390 15.019 1.00 23.11 
34.080 -1.398 13.968 1.00 20.49 
33 228 -1.917 14.656 1.00 21.90 
34*410 -1.860 12.781 1.00 19.59 
33 747 -3.027 12.248 1.00 19.42 
33 758 -3.014 10.706 1.00 19.12 
33 095 -4.285 10.157 1.00 18.71 

32 987 -1.786 10.187 1.00 18.56 

33 054 -1.588 8.683 1.00 15.05 
34.305 -4.338 12.832 1.00 19.02 
33 571 -5.300 12.982 1.00 19.98 
35.565 -4.344 13.233 1.00 19.03 

36 177 -5.518 13.822 1.00 19.74 

37 614 -5.631 13.340 1.00 19.62 

38 368 -4.478 13.683 1.00 22.08 
36.135 -5.502 15.355 1.00 20.48 
36.352 -6.521 16.010 1.00 21.19 
35.866 -4.346 15.939 1.00 20.99 
35.808 -4.235 17.396 1.00 20.43 
36.705 -3.074 17.927 1.00 20.22 
36 407 -2.785 19.382 1.00 20.37 
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ATOM 1857 CG2 VAL 901 

ATOM 1858 C VAL 901 

ATOM 1859 O VAL 901 

ATOM 18 60 N GLN 902 

5 ATOM 18 61 CA GLN 902 

ATOM 18 62 CB GLN 902 

ATOM 18 63 CG GLN 902 

ATOM 18 64 CD GLN 902 

ATOM 18 65 OE1 GLN 902 

10 ATOM 18 6 6 NE2 GLN 902 

ATOM 18 67 C GLN 902 

ATOM 18 68 O GLN 902 

ATOM 18 69 N VAL 903 

ATOM 1870 CA VAL 903 

15 ATOM 1871 CB VAL 903 

ATOM 1872 CGI VAL 903 

ATOM 18 73 CG2 VAL 903 

ATOM 187 4 C VAL 903 

ATOM 1875 O VAL 903 

20 ATOM 187 6 N PRO 90 4 

ATOM 1877 CD PRO 904 

ATOM 187 8 CA PRO 904 

ATOM 187 9 CB PRO 90 4 

ATOM 18 80 CG PRO 904 

25 ATOM 18 81 C PRO 904 

ATOM 1882 O PRO 904 

ATOM 1883 N LYS 905 

ATOM 18 8 4 CA LYS 905 

ATOM 1885 CB LYS 905 

30 ATOM 188 6 CG LYS 905 

ATOM 1887 CD LYS 905 

ATOM 188 8 CE LYS 905 

ATOM 188 9 NZ LYS 90 5 

ATOM 1890 C LYS 905 

35 ATOM 1891 O LYS 905 

ATOM 1892 N ILE 906 

ATOM 1893 CA ILE 906 

ATOM 1894 CB ILE 906 

ATOM 1895 CG2 ILE 906 

40 ATOM 1896 CGI ILE 906 

ATOM 1897 CD1 ILE 906 

ATOM 1898 C ILE 906 

ATOM 1899 O ILE 906 

ATOM 1900 N LEU 907 

45 ATOM 1901 CA LEU 907 

ATOM 1902 CB LEU 907 

ATOM 1903 CG LEU 907 

ATOM 1904 CD1 LEU 907 

ATOM 1905 CD2 LEU 907 

50 ATOM 1906 C LEU 907 

ATOM 1907 O LEU 907 

ATOM 1908 N SER 908 

ATOM 1909 CA SER 908 

ATOM 1910 CB SER 908 

55 ATOM 1911 OG SER 908 

ATOM 1912 C SER 908 

ATOM 1913 O SER 908 

ATOM 1914 N GLY 90 9 

ATOM 1915 CA GLY 909 

60 ATOM 1916 C GLY 909 

ATOM 1917 O GLY 909 

ATOM 1918 N LYS 910 



38.168 -3.436 17.782 1.00 18.81 
34 397 -4.087 17.935 1.00 20.42 
33 999 -4.823 18.841 1.00 21.68 
33 614 -3.187 17.350 1.00 19.34 

32 264 -2.957 17.828 1.00 17.55 
31 929 -1.476 17.735 1.00 19.32 
32*952 -0.579 18.371 1.00 20.82 

33 089 -0.776 19.861 1.00 23.15 
32*211 -1.336 20.528 1.00 23.22 

34 197 -0.288 20.404 1.00 25.36 

31 145 -3.766 17.207 1.00 16.24 
30 337 -4.326 17.938 1.00 15.40 
31.075 -3.810 15.872 1.00 15.79 
30 025 -4.552 15.144 1.00 15.22 
30 195 -4.461 13.594 1.00 14.30 

29 159 -5.314 12.883 1.00 13.20 
30.012 -3.005 13.147 1.00 14.90 
29.860 -6.010 15.605 1.00 14.74 

28 732 -6.489 15.693 1.00 14.48 

30 976 -6.729 15.893 1.00 14.65 

32 377 -6.425 15.571 1.00 13.72 
30*884 -8.122 16.356 1.00 15.80 
32 350 -8.481 16.602 1.00 15.45 
33*014 -7.830 15.512 1.00 14.60 

30 053 -8.206 17.632 1.00 16.77 

29 151 -9.039 17.713 1.00 18.38 
30.286 -7.295 18.589 1.00 17.00 
29.525 -7.292 19.830 1.00 16.34 
29 866 -6.085 20.668 1.00 18.17 

31 293 -6.007 21.132 1.00 19.96 

31 464 -4.733 21.947 1.00 22.09 

32 911 -4.429 22.276 1.00 23.59 
33*.003 -3.173 23.083 1.00 27.13 
28 039 -7.273 19.546 1.00 15.58 
27 251 -7.817 20.297 1.00 15.43 
27 647 -6.620 18.466 1.00 15.71 
26.239 -6.554 18.086 1.00 15.74 
25 991 -5.423 17.030 1.00 14.76 

24 527 -5.427 16.565 1.00 13.47 
26.358 -4.051 17.611 1.00 13.40 
26.021 -2.876 16.686 1.00 13.18 

25 800 -7.899 17.478 1.00 17.05 

24 759 -8.471 17.834 1.00 16.35 
26.609 -8.385 16.539 1.00 17.95 
26.348 -9.631 15.827 1.00 17.64 
27 331 -9.787 14.659 1.00 15.32 
27.338 -8.653 13.632 1.00 14.36 
28.382 -8.885 12.557 1.00 12.28 

25 947 -8.531 13.029 1.00 13.60 
26.386 -10.858 16.747 1.00 18.93 
25.756 -11.860 16.437 1.00 20.36 

27 097 -10.805 17.868 1.00 19.02 
27.103 -11.947 18.772 1.00 19.60 

28 469 -12.099 19.407 1.00 19.21 
28 811 -10.944 20.135 1.00 19.20 

26 027 -11.844 19.867 1.00 20.23 
25.946 -12.709 20.752 1.00 20.86 
25.208 -10.791 19.812 1.00 19.30 
24.169 -10.603 20.817 1.00 18.22 

24 533 -9.859 22.102 1.00 16.28 
23 711 -9.712 22.987 1.00 16.09 

25 773 -9.422 22.236 1.00 16.37 



* 



ATOM 1919 CA LYS . 910 

ATOM 1920 CB LYS 910 

ATOM 1921 CG LYS 910 

ATOM 1922 CD LYS 910 

5 ATOM 1923 CE LYS 910 

ATOM 1924 NZ LYS 910 

ATOM 1925 C LYS 910 

ATOM 192 6 O LYS 910 

ATOM 1927 N VAL 911 

10 ATOM 192 8 CA VAL 911 

ATOM 192 9 CB VAL 911 

ATOM 1930 CGI VAL 911 

ATOM 1931 CG2 VAL 911 

ATOM 1932 C VAL 911 

15 ATOM 1933 O VAL 911 

ATOM 1934 N LYS 912 

ATOM 1935 CA LYS 912 

ATOM 1936 CB LYS 912 

ATOM 1937 CG LYS 912 

20 ATOM 1938 CD LYS 912 

ATOM 1939 CE LYS 912 

ATOM 1940 NZ LYS 912 

ATOM 1941 C LYS 912 

ATOM 1942 O LYS 912 

25 ATOM 1943 N PRO 913 

ATOM 1944 CD PRO 913 

ATOM 194 5 CA PRO 913 

ATOM 194 6 CB PRO 913 

ATOM 1947 CG PRO 913 

30 ATOM 1948 C PRO 913 

ATOM 194 9 O PRO 913 

ATOM 1950 N ILE 914 

ATOM 1951 CA ILE 914 

ATOM 1952 CB ILE 914 

35 ATOM 1953 CG2 ILE 914 

ATOM 1954 CGI ILE 914 

ATOM 1955 CD1 ILE 914 

ATOM 1956 C ILE 914 

ATOM 1957 O ILE 914 

40 ATOM 1958 N TYR 915 

ATOM 1959 CA TYR 915 

ATOM 1960 CB TYR 915 

ATOM 1961 CG TYR 915 

ATOM 1962 CD1 TYR 915 

45 ATOM 1963 CE1 TYR 915 

ATOM 1964 CD2 TYR 915 

ATOM 1965 CE2 TYR 915 

ATOM 1966 CZ TYR 915 

ATOM 1967 OH TYR 915 

50 ATOM 19 68 C TYR 915 

ATOM 1969 O TYR 915 

ATOM 197 0 N PHE 916 

ATOM 1971 CA PHE 916 

ATOM 1972 CB PHE 916 

55 ATOM 1973 CG PHE 916 

ATOM 1974 CD1 PHE 916 

ATOM 1975 CD2 PHE 916 

ATOM 1976 CE1 PHE 916 

ATOM 1977 CE2 PHE 916 

60 ATOM 1978 CZ PHE 916 

ATOM 1979 C PHE 916 

ATOM 1980 O PHE 916 



# 

-69- 

26.166 -8.670 23.411 1.00 17.37 

27 665 -8.464 23.403 1.00 17.13 

28 418 -9.703 23.684 1.00 16.32 

29 860 -9.370 23.896 1.00 17.48 

30 577 -10.534 24.482 1-00 17.90 
32 055 -10.284 24.502 1.00 19.93 
25 472 -7.296 23.532 1.00 18.68 
25*250 -6.797 24.640 1.00 19.51 
25*219 -6.641 22.397 1.00 18.99 
24*545 -5.341 22.396 1.00 17.49 
25 501 -4.130 22.041 1.00 17.52 
26*928 -4.550 22.019 1.00 15.48 
25 094 -3.412 20.788 1.00 15.48 
23 379 -5.475 21.458 1.00 17.48 
23*504 -6.015 20.358 1.00 17.40 
22 219 -5.032 21.896 1.00 17.96 
21 057 -5.210 21.072 1.00 19.32 
20.189 -6.325 21.672 1.00 21.17 
19 261 -5.889 22.811 1.00 25.24 
19 998 -5.297 24.030 1.00 26.63 

19 509 -3.871 24.370 1.00 26.56 
18.028 -3.782 24.457 1.00 27.08 
20.262 -3.943 20.903 1.00 19.72 

20 437 -2.985 21.651 1.00 19.46 
19 463 -3.877 19.841 1.00 20.13 
19 437 -4.660 18.599 1.00 20.38 
18 693 -2.665 19.683 1.00 20.09 
18 174 -2.780 18.259 1.00 20.97 
18 127 -4.240 18.017 1.00 21.02 
17 555 -2.665 20.658 1.00 20.77 
17 108 -3.719 21.120 1.00 20.82 
17 094 -1.460 20.972 1.00 20.62 
15*965 -1.262 21.846 1.00 18.90 
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6, 
5. 

6. 
4 . 

4 . 

5, 



8.300 ■ 
7.354 
. 549 
, 921 
. 440 
. 614 
.350 
.446 
8.077 
9. 185 
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